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Description 

RELATED APPLICATIONS 

5 This application claims benefit of U.S. Provisional Application Number 60/027,032, filed September 24, 1 996. 

FIELD OF THE INVENTION 

This invention relates to newly identified polynucleotides and polypeptides, and their production and uses, as well 
10 as their variants, agonists and antagonists, and their uses. In particular, in these and in other regards, the invention 
relates to novel polynucleotides and polypeptides set forth in the Sequence Listing. 

BACKGROUND OF THE INVENTION 

is The Staphylococci make up a medically important genera of microbes. They are known to produce two types of 

disease, invasive and toxigenic. Invasive infections are characterized generally by abscess formation effecting both 
skin surfaces and deep tissues. Stapylococcus aureus (herein *S. aureus") is the second leading cause of bacteremia 
in cancer patients. Osteomyelitis, septic arthritis, septic thrombophlebitis and acute bacteria! endocarditis are also 
relatively common. There are at least three clinical conditions resulting from the toxigenic properties of Staphylococci. 

20 The manifestation of these diseases result from the actions of exotoxins as opposed to tissue invasion and bacteremia. 
These conditions include: Staphylococcal food poisoning, scalded skin syndrome and toxic shock syndrome. 

While certain Staphylococcal proteins associated with pathogenicity have been identified, e.g., coagulase, hemo- 
lysins, leucocidins and exo and enterotoxins, very little is known concerning the temporal expression of such genes 
during infection and disease progression in a mammalian host. Discovering the sets of genes the bacterium is likely 

25 to be expressing at the different stages of infection, particularly when an infection is established, provides critical in- 
formation for the screening and characterization of novel antibacterials which can interrupt pathogenesis. In addition 
to providing a fuller understanding of known proteins, such an approach will identify previously unrecognised targets. 

GUG is used as an initating nucleotide, rather than ATG, for a significant number of mRNA's in both Gram positive 
and Gram negative bacteria. Statistics on the frequency of NTG codons in the start codon for several bacterial species 

30 are available on line via computer (http:llbiochem.otago.ac.nz:BOO/Transterm/home_page.html). 

A discussion of initiation codons in B. subtilis is set forth in Vellanoweth, RL.1993 in Bacillus subtilis and other Gram 
Positive Bacteria, Biochemistry, Physiology and Molecular Genetics, Sonenshein, Hoch, LosickEds. Amer. Soc. Micro- 
biol, Washington DC. p. 699-711. Vellenworth indicates a major difference between B. subtilis and the gram-negative 
organisms is in the choice of initiation codon. 91% of the sequenced E coli genes start with AUG. By contrast, about 

35 30% of B. subtilis and other clostridial branch gened start with UUG or GUG. Moreover, CUG functions as a start codon 
in B. subtilis. Mutations of an AUG initiation codon to GUG or UUG often cause decreased expression in B. subtilis 
and E coli Generally, translation efficiency is higher with AUG initiation codons. A strong Shine-Delgarno ribosome 
binding site, however, can compensate almost fully for a weak initiation codon. It has been reported that genes with a 
range of expression levels have initiation codons other than ATG in gram positives (Vellanoweth, RL1993 in Bacillus 

40 subtilis and other Gram Positive Bacteria, Biochemistry, Physiology and Molecular Genetics, Son enshe in , Hoch , Los ic k 
Eds. Amer. Soc. Microbiol, Washington DC. p. 699-711). 

Provided herein are ORF sequences from genes possessing GUG initiation codons and proteins expressed there- 
from to be used for screening for antimicrobial compounds. Clearly, there is a need for polypeptide and polynucleotide 
sequences that may be used to screen for antimicrobial compound and which may also be used to determine the roles 

45 of such sequences in pathogenesis of infection, dysfunction and disease. There is also need, therefore, for identification 
and characterization of such sequences which may play a role in preventing, ameliorating or correcting infections, 
dysfunctions or diseases. 

The polypeptides of the invention have amino acid sequence homology to a known protein(s) as set forth in Table 1 . 
50 SUMMARY OF THE INVENTION 

It is an object of the invention to provide polypeptides that have been identified as novel polypeptides by homology 
between an amino acid sequence selected from the group consisting of the sequences set out in the Sequence Listing 
and a known amino acid sequence or sequences of other proteins such as the protein identities listed in Table 1 . 
55 it is a further object of the invention to provide polynucleotides that encode novel polypeptides, particularly poly- 

nucleotides that encode polypeptides of Staphylococcus aureus. 

In a particularly preferred embodiment of the invention the polynucleotide comprises a region encoding a polypep- 
tide comprising a sequence sequence selected from the group consisting of the sequences set out in the Sequence 
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Listing, or a variant of any of these sequences. 

In another particularly preferred embodiment of the invention there is a novel protein from Staphylococcus aureus 
comprising an amino acid sequence selected from the group consisting of the sequences set out in the Sequence 
Listing, or a variant of any of these sequences, 
s In accordance with another aspect of the invention there is provided an isolated nucleic acid molecule encoding 

a mature polypeptide expressible by the Staphylococcus aureus WCUH29 strain contained in the deposited strain. 

A further aspect of the invention there are provided isolated nucleic acid molecules encoding a polypeptide of the 
invention, particularly Staphylococcus aureus polypeptide, and including mRNAs, cDNAs, genomic DNAs. Further 
embodiments of the invention include biologically, diagnostically, prophylactically, clinically or therapeutically useful 
10 variants thereof, and compositions comprising the same. 

In accordance with another aspect of the invention, there is provided the use of a polynucleotide of the invention 
for therapeutic or prophylactic purposes, in particular genetic immunization. Among the particularly preferred embod- 
iments of the invention are naturally occurring allelic variants of a polypeptide of the invention and polypeptides encoded 
thereby. 

15 Another aspect of the invention there are provided novel polypeptides of Staphylococcus aureus as well as bio- 

logically, diagnostically, prophylactically, clinically or therapeutically useful variants thereof, and compositions compris- 
ing the same. 

Among the particularly preferred embodiments of the invention are variants of the polypeptides of the invention 
encoded by naturally occurring alleles of their genes. 
20 in a preferred embodiment of the invention there are provided methods for producing the aforementioned polypep- 

tides. 

In accordance with yet another aspect of the invention, there are provided inhibitors to such polypeptides, useful 
as antibacterial agents, including, for example, antibodies. 

In accordance with certain preferred embodiments of the invention, there are provided products, compositions and 
25 methods for assessing expression of the polypeptides and polynucleotides of the invention, treating disease, for ex- 
ample, infections of the upper respiratory tract (e.g., otitis media, bacterial tracheitis, acute epiglottitis, thyroiditis), lower 
respiratory (e.g., empyema, lung abscess), cardiac (e.g., infective endocarditis), gastrointestinal (e.g., secretory diar- 
rhoea, splenic absces, retroperitoneal abscess), CNS (e.g., cerebral abscess), eye (e.g., blepharitis, conjunctivitis, 
keratitis, endophthalmitis, preseptal and orbital cellulitis, darcryocystitis), kidney and urinary tract (e.g., epididymitis, 
30 intrarenal and perinephric absces, toxic shock syndrome), skin (e.g., impetigo, folliculitis, cutaneous abscesses, cel- 
lulitis, wound infection, bacterial myositis) bone and joint (e.g., septic arthritis, osteomyelitis), assaying genetic variation, 
and administering a polypeptide or polynucleotide of the invention to an organism to raise an immunological response 
against a bacteria, especially a Staphylococcus aureus bacteria. 

In accordance with certain preferred embodiments of this and other aspects of the invention there are provided 
35 polynucleotides that hybridize to a polynucleotide sequence of the invention, particularly under stringent conditions. 

In certain preferred embodiments of the invention there are provided antibodies against polypeptides of the inven- 
tion. 

In other embodiments of the invention there are provided methods for identifying compounds which bind to or 
otherwise interact with and inhibit or activate an activity of a polypeptide or polynucleotide of the invention comprising: 
40 contacting a polypeptide or polynucleotide of the invention with a compound to be screened under conditions to permit 
binding to or other interaction between the compound and the polypeptide or polynucleotide to assess the binding to 
or other interaction with the compound, such binding or interaction being associated with a second component capable 
of providing a detectable signal in response to the binding or interaction of the polypeptide or polynucleotide with the 
compound; and determining whether the compound binds to or otherwise interacts with and activates or inhibits an 
45 activity of the polypeptide or polynucleotide by detecting the presence or absence of a signal generated from the binding 
or interaction of the compound with the polypeptide or polynucleotide. 

In accordance with yet another aspect of the invention, there are provided agonists and antagonists of the polypep- 
tides and polynucleotides of the invention, preferably bacteriostatic or bacteriocidal agonists and antagonists. 

In a further aspect of the invention there are provided compositions comprising a polynucleotide or a polypeptide 
50 of the invention for administration to a cell or to a multicellular organism. 

Various changes and modifications within the spirit and scope of the disclosed invention will become readily ap- 
parent to those skilled in the art from reading the following descriptions and from reading the other parts of the present 
disclosure. 



55 GLOSSARY 



The following definitions are provided to facilitate understanding of certain terms used frequently herein. 

"Host cell" is a cell which has been transformed or transfected, or is capable of transformation or transfection by 
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an exogenous polynucleotide sequence. 

"Identity," as known in the art, is a relationship between two or more polypeptide sequences or two or more poly- 
nucleotide sequences, as determined by comparing the sequences. In the art, "identity" also means the degree of 
sequence relatedness between polypeptide or polynucleotide sequences, as the case may be, as determined by the 

5 match between strings of such sequences. "Identity" and "similarity" can be readily calculated by known methods, 
including but not limited to those described in (Computational Molecular Biology, Lesk, A.M., ed. t Oxford University 
Press, New York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D.W., ed. t Academic Press, New 
York, 1993; Computer Analysis of Sequence Data, Pari I, Griffin, A.M., and Griffin, H.G., eds., Humana Press, New 
Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1 987; and Sequence Analysis 

io Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991 ; and Carillo, K, and Lipman, D., SI AM 
J. Applied Math., 48: 1073 (1988). Preferred methods to determine identity are designed to give the largest match 
between the sequences tested. Methods to determine identity and similarity are codified in publicly available computer 
programs. Preferred computer program methods to determine identity and similarity between two sequences include, 
but are not limited to, the GCG program package (Devereux, J., et al., Nucleic Acids Research 12(1): 387 (1984)), 

is BLASTP, BLASTN, and FASTA (Atschul, S.F. et al., J. Molec. Biol. 215: 403-410 (1990). The BLAST X program is 
publicly available from NCBI and other sources (BLAST Manual, Altschul, S., et al., NCB! NLM NIH Bethesda, MD 
20894; Altschul, S., etal., J. Mol. Biol. 275:403-410 (1 990). As an illustration, by a polynucleotide having a nucleotide 
sequence having at least, for example, 95% "identity" to a reference nucleotide sequence it is intended that the nucle- 
otide sequence of the tested polynucleotide is identical to the reference sequence except that the polynucleotide se- 

20 quence may include up to five point mutations per each 100 nucleotides of the reference nucleotide sequence. In other 
words, to obtain a polynucleotide having a nucleotide sequence at least 95% identical to a reference nucleotide se- 
quence, up to 5% of the nucleotides in the reference sequence may be deleted or substituted with another nucleotide, 
or a number of nucleotides up to 5% of the total nucleotides in the reference sequence may be inserted into the reference 
sequence. These mutations of the reference sequence may occur at the 5" or 3' terminal positions of the reference 

25 nucleotide sequence or anywhere between those terminal positions, interspersed either individually among nucleotides 
in the reference sequence or in one or more contiguous groups within the reference sequence. Analogously , by a 
polypeptide having an amino acid sequence having at least, for example, 95% identity to a reference amino acid 
sequence is intended that the test amino acid sequence of the polypeptide is identical to the reference sequence except 
that the polypeptide sequence may include up to five amino acid alterations per each 1 00 amino acids of the reference 

30 amino acid. In other words, to obtain a polypeptide having an amino acid sequence at least 95% identical to a reference 
amino acid sequence, up to 5% of the amino acid residues in the reference sequence may be deleted or substituted 
with another amino acid, or a number of amino acids up to 5% of the total amino acid residues in the reference sequence 
may be inserted into the reference sequence. These alterations of the reference sequence may occur at the amino or 
carboxy terminal positions of the reference amino acid sequence or anywhere between those terminal positions, in- 

35 terspersed either individually among residues in the reference sequence or in one or more contiguous groups within 
the reference sequence. 

"Isolated" means altered "by the hand of man" from its natural state, i.e., if it occurs in nature, it has been changed 
or removed from its original environment, or both. For example, a polynucleotide or a polypeptide naturally present in 
a living organism is not "isolated," but the same polynucleotide or polypeptide separated from the coexisting materials 

40 of its natural state is "isolated", as the term is employed herein. 

"Polynucleotide(s)" generally refers to any polyribonucleotide or polydeoxribonucleotide, which may be unmodified 
RNA or DNA or modified RNA or DNA. "Polynucleotide(s)" include, without limitation, single- and double-stranded 
DNA, DNA that is a mixture of single- and double-stranded regions or single-, double- and triple-stranded regions, 
single- and double-stranded RNA, and RNA that is mixture of single- and double-stranded regions, hybrid molecules 

45 comprising DNA and RNA that may be single-stranded or, more typically, double-stranded, or triple-stranded regions, 
or a mixture of single- and double-stranded regions. In addition, "polynucleotide" as used herein refers to triple-stranded 
regions comprising RNA or DNA or both RNA and DNA. The strands in such regions may be from the same molecule 
or from different molecules. The regions may include all of one or more of the molecules, but more typically involve 
only a region of some of the molecules. One of the molecules of a triple-helical region often is an oligonucleotide. As 

50 used herein, the term 'polynucleotide(s)" also includes DNAs or RNAs as described above that contain one or more 
modified bases. Thus, DNAs or RNAs with backbones modified for stability or for other reasons are 'polynucleotide(s) 
" as that term is intended herein. Moreover, DNAs or RNAs comprising unusual bases, such as inosine, or modified 
bases, such as tritylated bases, to name just two examples, are polynucleotides as the term is used herein. It will be 
appreciated that a great variety of modifications have been made to DNA and RNA that serve many useful purposes 

55 known to those of skill in the art. The term "polynucleotide(s)" as it is employed herein embraces such chemically, 
enzymatically or metabolically modified forms of polynucleotides, as well as the chemical forms of DNA and RNA 
characteristic of viruses and cells, including, for example, simple and complex cells. "Polynucleotide(s)" also embraces 
short polynucleotides often referred to as oligonucleotide(s). 
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"Polypeptide(s)' refers to any peptide or protein comprising two or more amino acids joined to each other by peptide 
bonds or modified peptide bonds. °Polypeptide(s)" refers to both short chains, commonly referred to as peptides, oli- 
gopeptides and oligomers and to longer chains generally referred to as proteins. Polypeptides may contain amino acids 
other than the 20 gene encoded amino acids. "Polypeptide(s)" include those modified either by natural processes, 
5 such as processing and other post-trans lational modifications, but also by chemical modification techniques. Such 
modifications are well described in basic texts and in more detailed monographs, as well as in a voluminous research 
literature, and they are well known to those of skill in the art. It will be appreciated that the same type of modification 
may be present in the same or varying degree at several sites in a given polypeptide. Also, a given polypeptide may 
contain many types of modifications. Modifications can occur anywhere in a polypeptide, including the peptide back- 
to bone, the amino acid side-chains, and the amino or carboxyl termini. Modifications include, for example, acetylation, 
acylation, ADP-ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme moiety, covalent 
attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, covalent attachment 
of phosphot idyl inositol, cross-linking, cyclization, disulfide bond formation, demethylation, formation of covalent cross- 
links, formation of cysteine, formation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI anchor 
is formation, hydroxylation, iodination, methylatton, myristoylation, oxidation, proteolytic processing, phosphorylation, 
prenylation, racemization, glycosylation, lipid attachment, sulfation, gamma-carboxylation of glutamic acid residues, 
hydroxylation and ADP-ribosylation, selenoylation, sulfation, transfer-RNA mediated addition of amino acids to pro- 
teins, such as arginylation, and ubiquitination. See, for instance, PROTEINS - STRUCTURE AND MOLECULAR 
PROPERTIES, 2nd Ed., T. E. Creighton, W. H. Freeman and Company, New York (1 993) and Wold, R, Posttranslational 
20 Protein Modifications: Perspectives and Prospects, pgs. 1 -1 2 in POSTTRANSLATIONAL COVALENT MODIFICATION 
OF PROTEINS, B. C. Johnson, Ed., Academic Press, New York (1983); Seifter et al., Meth. EnzymoL 782626-646 
(1 990) and Rattan et al., Protein Synthesis: Posttranslational Modifications and Aging, Ann. N.Y. Acad. Sci. 663: 48-62 
(1992). Polypeptides may be branched or cyclic, with or without branching. Cyclic, branched and branched circular 
polypeptides may result from post-translational natural processes and may be made by entirely synthetic methods, as 
25 well. 

' Variant(s)" as the term is used herein, is a polynucleotide or polypeptide that differs from a reference polynucleotide 
or polypeptide respectively, but retains essential properties. A typical variant of a polynucleotide differs in nucleotide 
sequence from another, reference polynucleotide. Changes in the nucleotide sequence of the variant may or may not 
alter the amino acid sequence of a polypeptide encoded by the reference polynucleotide. Nucleotide changes may 

30 result in amino acid substitutions, additions, deletions, fusions and truncations in the polypeptide encoded by the ref- 
erence sequence, as discussed below A typical variant of a polypeptide differs in amino acid sequence from another, 
reference polypeptide. Generally, differences are limited so that the sequences of the reference polypeptide and the 
variant are closely similar overall and, in many regions, identical. A variant and reference polypeptide may differ in 
amino acid sequence by one or more substitutions, additions, deletions in any combination. A substituted or inserted 

35 amino acid residue may or may not be one encoded by the genetic code. A variant of a polynucleotide or polypeptide 
may be a naturally occurring such as an allelic variant, or it may be a variant that is not known to occur naturally. Non- 
naturally occurring variants of polynucleotides and polypeptides may be made by mutagenesis techniques, by direct 
synthesis, and by other recombinant methods known to skilled artisans. 

40 DESCRIPTION OF THE INVENTION 

Each of polynucleotide and polypeptide sequences provided herein may be used in the discovery and development 
of antibacterial compounds. Upon expression of the sequences with the appropriate initiation and termination codons 
the encoded polypeptide can be used as a target for the screening of antimicrobial drugs. Additionally, the DNA se- 

45 quences encoding preferably the amino terminal regions of the encoded protein or the Shine-Delgarno region can be 
used to construct antisense sequences to control the expression of the coding sequence of interest. Furthermore, 
many of the sequences disclosed herein also provide regions upstream and downstream from the encoding sequence. 
These sequences are useful as a source of regulatory elements for the control of bacterial gene expression. Such 
sequences are conveniently isolated by restriction enzyme action or synthesized chemically and introduced, for ex- 

50 ample, into promoter identification strains. These strains contain a reporter structural gene sequence located down- 
stream from a restriction site such that if an active promoter is inserted, the reporter gene will be expressed. 

Although each of the sequences may be employed as described above, this invention also provides several means 
for identifying particularly useful target genes. The first of these approaches entails searching appropriate databases 
for sequence matches in related organisms. Thus, if a homologue exists, the Streptococcal -like form of this gene would 

55 likely play an analogous role. For example, a Streptococcal protein identified as homologous to a cell surface protein 
in another organism would be useful as a vaccine candidate. To the extent such homologies have been identified for 
the sequences disclosed herein they are reported along with the encoding sequence. 

Each of the DNA sequences provided herein may be used in the discovery and development of antibacterial com- 
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pounds. Because each of the sequences contains an open reading frame (ORF) with an appropriate initiation and 
termination codons, the encoded protein upon expression can be used as a target for the screening of antimicrobial 
drugs. Additionally, the DNA sequences encoding the amino terminal regions of the encoded protein can be used to 
construct antisense sequences to control the expression of the coding sequence of interest. Furthermore, many of the 
5 sequences disclosed herein also provide regions upstream and downstream from the encoding sequence. These se- 
quences are useful as a source of regulatory elements for the control of bacterial gene expression. Such sequences 
are conveniently isolated by restriction enzyme action or synthesized chemically and introduced, for example, into 
promoter identification strains. These strains contain a reporter structural gene sequence located downstream from a 
restriction site such that if an active promoter is inserted, the reporter gene will be expressed. 
10 It is believed that bacteria possess a number of ways of regulating gene expression levels, especially in subtle 

degrees, and the interplay between ribosome binding site and inititation codon is utilized for this purpose for these 
genes. It is also believed that such genes will be important targets for antimicrobial drug discovery, particularly since 
pathogenesis genes are believed undergo gene expression regulation during in the pathogenesis process. Therefore, 
the invention provides ORF sequences possessing a GTG (GUG ) initiation codon and protein targets expressed there- 
in form. 

Although each of the sequences may be employed as described above, this invention also provides several means 
for identifying particularly useful target genes. The first of these approaches entails searching appropriate databases 
for sequence matches in related organisms. Thus, if a homologue exists, the Staphylococcal-like form of this gene 
would likely play an analogous role. For example, a Staphylococcal protein identified as homologous to a cell surface 
20 protein in another organism would be useful as a vaccine candidate. To the extent such homologies have been identified 
for the sequences disclosed herein they are reported along with the encoding sequence. 



ORF Gene Expression 



Recently techniques have become available to evaluate temporal gene expression in bacteria, particularly as it 
applies to viability under laboratory and infection conditions. A number of methods can be used to identify genes which 
are essential to survival perse, or essential to the establishment/maintenance of an infection. Identification of an ORF 
unknown by one of these methods yields additional information about its function and permits the selection of such an 
ORF for further development as a screening target. Briefly, these approaches include: 

1) Signature Tagged Mutagenesis (STM): This technique is described by Hensel et aj., Science 269: 400-403 
(1 995), the contents of which is incorporated by reference for background purposes. Signature tagged mutagenesis 
identifies genes necessary for the establishment/maintenance of infection in a given infection model. 

The basis of the technique is the random mutagenesis of target organism by various means (e.g., transposons) 
such that unique DNA sequence tags are inserted in close proximity to the site of mutation. The tags from a mixed 
population of bacterial mutants and bacteria recovered from an infected hosts are detected by amplification, radi- 
olabeling and hybridisation analysis. Mutants attenuated in virulence are revealed by absence of the tag from the 
pool of bacteria recovered from infected hosts. 

In Staphylococcus aureus, because the transposon system is less well developed, a more efficient way of 
creating the tagged mutants is to use the insertion-duplication mutagenesis technique as described by Morrison 
et a I., J. Bacterid. 159:870 (1984) the contents of which is incorporated by reference for background purposes. 

2) In Vivo Expression Technology (I VET): This technique is described by Camilli et aj., Proc . Natl Acad . Sci . 
USA . 91 :2634-2638 (1994), the contents of which is incorporated by reference for background purposes. I VET 
identifies genes up-regulated during infection when compared to laboratory cultivation, implying an important role 
in infection. ORF identified by this technique are implied to have a significant role in infection establish mentAnain- 
tenance. 

In this technique random chromosomal fragments of target organism are cloned upstream of a promoter-less 
recombinase gene in a plasmid vector. This construct is introduced into the target organism which carries an 
antibiotic resistance gene flanked by resolvase sites. Growth in the presence of the antibiotic removes from the 
population those fragments cloned into the plasmid vector capable of supporting transcription of the recombinase 
gene and therefore have caused loss of antibiotic resistance. The resistant pool is introduced into a host and at 
various times after infection bacteria may be recovered and assessed for the presence of antibiotic resistance. 
The chromosomal fragment carried by each antibiotic sensitive bacterium should carry a promoter or portion of a 
gene normally upregulated during infection. Sequencing upstream of the recombinase gene allows identification 
of the up regulated gene. 

3) Differential display: This technique is described by Chuang et al, J. Bacteriol . 175:2026-2036 (1993), the 
contents of which is incorporated by reference for background purposes. This method identifies those genes which 
are expressed in an organism by identifying mRNA present using randomly-primed RT-PCR. By comparing pre- 
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infection and post infection profiles, genes up and down regulated during infection can be identified and the RT- 
PCR product sequenced and matched to ORF 'unknowns'. 

4) Generation of conditional lethal mutants by transposon mutagenesis: This technique, described by de 
Lorenzo, V. eta]., Gene 123:17-24 (1993); Neuwald, A. F. et al.. Gene 125: 69-73(1993); and Takiff, H. E. et al., 

5 J. Bacteriol . 174:1544-1553(1992), the contents of which is incorporated by reference for background purposes, 

identifies genes whose expression are essential for cell viability. 

In this technique transposons carrying controllable promoters, which provide transcription outward from the 
transposon in one or both directions, are generated. Random insertion of these transposons into target organisms 
and subsequent isolation of insertion mutants in the presence of inducer of promoter activity ensures that insertions 

10 which separate promoter from coding region of a gene whose expression is essential for cell viability will be re- 

covered. Subsequent replica plating in the absence of inducer identifies such insertions, since they fail to survive. 
Sequencing of the flanking regions of the transposon allows identification of site of insertion and identification of 
the gene disrupted. Close monitoring of the changes in cellular processes/morphology during growth in the absence 
of inducer yields information on likely function of the gene. Such monitoring could include flow cytometry (cell 

15 division, lysis, redox potential, DNA replication), incorporation of radiochemical ly labeled precursors into DNA, 

RNA, protein, lipid, peptidoglycan, monitoring reporter enzyme gene fusions which respond to known cellular 
stresses. 

5) Generation of conditional lethal mutants by chemical mutagenesis: This technique is described by Beck- 
with, J., Methods in Enzvmology 204: 

20 3-1 8(1 991 ), the contents of which are incorporated herein by reference for background purposes. In this technique 

random chemical mutagenesis of target organism, growth at temperature other than physiological temperature 
(permissive temperature) and subsequent replica plating and growth at different temperature (e.g. 42"C to identify 
ts, 25"C to identify cs) are used to identify those isolates which now fail to grow (conditional mutants). As above 
close monitoring of the changes upon growth at the non-permissive temperature yields information on the function 

25 of the mutated gene. Complementation of conditional lethal mutation by library from target organism and sequenc- 

ing of complementing gene allows matching with unknown ORF. 

6) RT-PCR: Staphylococcus aureus messenger RNA is isolated from bacterial infected tissue e.g. 48 hour murine 
lung infections, and the amount of each mRNA species assessed by reverse transcription of the RNA sample 
primed with random hexanucleotides followed by PCR with gene specific primer pairs. The determination of the 

30 presence and amount of a particular mRNA species by quantification of the resultant PCR product provides infor- 

mation on the bacterial genes which are transcribed in the infected tissue. Analysis of gene transcription can be 
carried out at different times of infection to gain a detailed knowledge of gene regulation in bacterial pathogenesis 
allowing for a clearer understanding of which gene products represent targets for screens for novel antibacterials. 
Because of the gene specific nature of the PCR primers employed it should be understood that the bacterial mRNA 

35 preparation need not be free of mammalian RNA. This allows the investigator to carry out a simple and quick RNA 

preparation from infected tissue to obtain bacterial mRNA species which are very short lived in the bacterium (in 
the order of 2 minute halflives). Optimally the bacterial mRNA is prepared from infected murine lung tissue by 
mechanical disruption in the presence of TRIzole (GIBCO-BRL) for very short periods of time, subsequent process- 
ing according to the manufacturers of TRIzole reagent and DNAase treatment to remove contaminating DNA. 

40 Preferably the process is optimised by finding those conditions which give a maximum amount of Staphylococcus 

aureus 16S ribosomal RNA as detected by probing Northerns with a suitably labelled sequence specific oligonu- 
cleotide probe. Typically a 5' dye labelled primer is used in each PCR primer pair in a PCR reaction which is 
terminated optimally between B and 25 cycles. The PCR products are separated on 6% polyacrylamide gels with 
detection and quantification using GeneScanner (manufactured by ABI). 

45 

Each of these techniques may have advantages or disadvantage depending on the particular application. The 
skilled artisan would choose the approach that is the most relevant with the particular end use in mind. 

Use of the of these technologies when applied to the ORFs of the present invention enables identification of bac- 
terial proteins expressed during infection, inhibitors of which would have utility in anti-bacterial therapy. 
50 The invention relates to novel polypeptides and polynucleotides as described in greater detail below. In particular, 

the invention relates to polypeptides and polynucleotides of Staphylococcus aureus, which is related by amino acid 
sequence homology to known polypeptide as set forth in Table 1 . The invention relates especially to compounds having 
the nucleotide and amino acid sequence selected from the group consisting of the sequences set out in the Sequence 
Listing, and to the nucleotide sequences of the DNA in the deposited strain and amino acid sequences encoded thereby. 

55 

Deposited materials 

S. aureus WCUH 29 has been deposited at the National Collection of Industrial and Marine Bacteria Ltd. (NCI MB), 
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Aberdeen, Scotland under number NCIMB 40771 on 11 September 1995. 

The Staphylococcus aureus strain deposit is referred to herein as "the deposited strain" or as "the DNA of the 
deposited strain.' 

The deposited strain contains the full length genes comprising the polynucleotides set forth in the Sequence Listing. 
5 The sequence of the polynucleotides contained in the deposited strain, as well as the amino acid sequence of the 
polypeptide encoded thereby, are controlling in the event of any conflict with any description of sequences herein. 

The deposit of the deposited strain has been made under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Micro-organisms for Purposes of Patent Procedure. The strain will be irrevocably and 
without restriction or condition released to the public upon the issuance of a patent. The deposited strain is provided 
io merely as convenience to those of skill in the art and is not an admission that a deposit is required for enablement, 
such as that required under 35 U.S.C. §112. 

A license may be required to make, use or sell the deposited strain, and compounds derived therefrom, and no 
such license is hereby granted. 

15 Polypeptides 

The polypeptides of the invention include the polypeptides set forth in the Sequence Listing (in particular the mature 
polypeptide) as well as polypeptides and fragments, particularly those which have the biological activity of a polypeptide 
of the invention, and also those which have at least 50%, 60% or 70% identity to a polypeptide sequence selected 

20 from the group consisting of the sequences set out in the Sequence Listing or the relevant portion, preferably at least 
80% identity to a polypeptide sequence selected from the group consisting of the sequences set out in the Sequence 
Listing, and more preferably at least 90% similarity (more preferably at least 90% identity) to a polypeptide sequence 
selected from the group consisting of the sequences set out in the Sequence Listing, and still more preferably at least 
95% similarity (still more preferably at least 95% identity) to a polypeptide sequence selected from the group consisting 

25 of the sequences set out in the Sequence Listing, and also include portions of such polypeptides with such portion of 
the polypeptide generally containing at least 30 amino acids and more preferably at least 50 amino acids. 
The invention also includes polypeptides of the formula: 

30 X-(R 1 ) n -(R 2 )-(R 3 ) n -Y 

wherein, at the amino terminus, X is hydrogen, and at the carboxyl terminus, Y is hydrogen or a metal, R, and R 3 are 
any amino acid residue, n is an integer between 1 and 2000, and R 2 is an amino acid sequence of the invention, 
particularly an amino acid sequence selected from the group set forth in the Sequence Listing. In the formula above 
35 R 2 is oriented so that its amino terminal residue is at the left, bound to R-, and its carboxy terminal residue is at the 
right, bound to R 3 . Any stretch of amino acid residues denoted by either R group, where R is greater than 1 , may be 
either a heteropolymer or a homopotymer, preferably a heteropolymer. In preferred embodiments n is an integer be- 
tween 1 and 1000 or 2000. 

A fragment is a variant polypeptide having an amino acid sequence that entirely is the same as part but not all of 

40 the amino acid sequence of the aforementioned polypeptides. As with polypeptides, fragments may be "free-standing, 
" or comprised within a larger polypeptide of which they form a part or region, most preferably as a single continuous 
region, a single larger polypeptide. 

Preferred fragments include, for example, truncation polypeptides having a portion of the amino acid sequence of 
the Sequence Listing, or of variants thereof, such as a continuous series of residues that includes the amino terminus, 

45 or a continuous series of residues that includes the carboxyl terminus. Degradation forms of the polypeptides of the 
invention in a host cell, particularly a Staphylococcus aureus, are also preferred. Further preferred are fragments char- 
acterized by structural or functional attributes such as fragments that comprise alpha-helix and alpha-helix forming 
regions, beta-sheet and beta-sheet-forming regions, turn and turn-forming regions, coil and coil-forming regions, hy- 
drophilic regions, hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, surface- 

50 forming regions, substrate binding region, and high antigenic index regions. 

Also preferred are biologically active fragments which are those fragments that mediate activities of polypeptides 
of the invention, including those with a similar activity or an improved activity, or with a decreased undesirable activity. 
Also included are those fragments that are antigenic or immunogenic in an animal, especially in a human. Particularly 
preferred are fragments comprising receptors or domains of enzymes that confer a function essential for viability of 

ss Staphylococcus aureus or the ability to initiate, or maintain cause disease in an individual, particularly a human. 

Variants that are fragments of the polypeptides of the invention may be employed for producing the corresponding 
full-length polypeptide by peptide synthesis; therefore, these variants may be employed as intermediates for producing 
the full-length polypeptides of the invention. 
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In addition to the standard single and triple letter representations for amino acids, the term 'X 0 or "Xaa" is also 
used. "X" and "Xaa" mean that any of the twenty naturally occuring amino acids may appear at such a designated 
position in the polypeptide sequence. 

s Polynucleotides 

The nucleotide sequences disclosed herein can be obtained by synthetic chemical techniques known in the art or 
can be obtained from S. aureus WCUH 29 by probing a DNA preparation with probes constructed from the particular 
sequences disclosed herein. Alternatively, oligonucleotides derived from a disclosed sequence can act as PCR primers 

10 in a process of PCR-based cloning of the sequence from a bacterial genomic source. It is recognised that such se- 
quences will also have utility in diagnosis of the stage of infection and type of infection the pathogen has attained. 

To obtain the polynucleotide encoding the protein using the DNA sequence given herein typically a library of clones 
of chromosomal DNA of S.aureus WCUH 29 in E. coli or some other suitable host is probed with a radiolabeled 
oligonucleotide, preferably a 17mer or longer, derived from the partial sequence. Clones carrying DNA identical to that 

fs of the probe can then be distinguished using high stringency washes. By sequencing the individual clones thus identified 
with sequencing primers designed from the original sequence it is then possible to extend the sequence in both direc- 
tions to determine the full gene sequence. Conveniently such sequencing is performed using denatured double strand- 
ed DNA prepared from a plasmid clone. Suitable techniques are described by Maniatis, T, Fritsch, E.F. and Sambrook, 
J. in MOLECULAR CLONING, A Laboratory Manual, 2nd edition, 1989, Cold Spring Harbor Laboratory (see: Screening 

20 By Hybridization 1 .90 and Sequencing Denatured Double-Stranded DNA Templates 1 3.70). 

Moerover, another aspect of the invention relates to isolated polynucleotides that encode the polypeptides of the 
invention having a deduced amino acid sequence selected from the group consisting of the sequences in the Sequence 
Listing and polynucleotides closely related thereto and variants thereof. 

Using the information provided herein, such as the polynucleotide sequences set out in the Sequence Listing, a 

25 polynucleotide of the invention encoding polypeptide may be obtained using standard cloning and screening methods, 
such as those for cloning and sequencing chromosomal DNA fragments from bacteria using Staphylococcus aureus 
WCUH29 cells as starting material, followed by obtaining a full length clone. For example, to obtain a polynucleotide 
sequence of the invention, such as a sequence set forth in the Sequence Listing, typically a library of clones of chro- 
mosomal DNA of Staphylococcus aureus WCUH29 in E.colior some other suitable host is probed with a radiolabeled 

30 oligonucleotide, preferably a 17-mer or longer, derived from a partial sequence. Clones carrying DNA identical to that 
of the probe can then be distinguished using stringent conditions. By sequencing the individual clones thus identified 
with sequencing primers designed from the original sequence it is then possible to extend the sequence in both direc- 
tions to determine the full gene sequence. Conveniently, such sequencing is performed using denatured double strand- 
ed DNA prepared from a plasmid clone. Suitable techniques are described by Maniatis, T, Fritsch, E.F. and Sambrook 

35 et al., MOLECULAR CLONING, A LABORATORY MANUAL, 2nd Ed.; Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York (1989). (see in particular Screening By Hybridization 1.90 and Sequencing Denatured Double- 
Stranded DNA Templates 1 3.70). Illustrative of the invention, the polynucleotides set out in the Sequence Listing were 
discovered in a DNA library derived from Staphylococcus aureus WCUH29. 

The DNA sequences set out in the Sequence Listing each contains at least one open reading frame encoding a 

40 protein having at least about the number of amino acid residues set forth in the Sequence Listing. The start and stop 
codons of each open reading frame (herein "ORF") DNA are the first three and the last three nuclotides of each poly- 
nucleotide set forth in the Sequence Listing. 

Certain polynucleotides and polypeptides of the invention are structurally related to known proteins as set forth in 
Table 1 . These proteins exhibit greatest homology to the homologue listed in Table 1 from among the known proteins. 

45 The invention provides a polynucleotide sequence identical over its entire length to each coding sequence in the 

Sequence Listing. Also provided by the invention is the coding sequence for the mature polypeptide or a fragment 
thereof, by itself as well as the coding sequence for the mature polypeptide or a fragment in reading frame with other 
coding sequence, such as those encoding a leader or secretory sequence, a pre-, or pro- or prepro- protein sequence. 
The polynucleotide may also contain non-coding sequences, including for example, but not limited to non-coding 5* 

50 and 3' sequences, such as the transcribed, non-translated sequences, termination signals, ribosome binding sites, 
sequences that stabilize mRNA, introns, polyadenylation signals, and additional coding sequence which encode ad- 
ditional amino acids. For example, a marker sequence that facilitates purification of the fused polypeptide can be 
encoded. In certain embodiments of the invention, the marker sequence is a hexa-histidine peptide, as provided in the 
pQE vector (Qiagen, Inc.) and described in Gentz et al., Proc. Natl. Acad. Sci, USA 86: 821 -824 (1 989), or an HA tag 

55 (Wilson et al., Cell 37: 767 (1 984). Polynucleotides of the invention also include, but are not limited to, polynucleotides 
comprising a structural gene and its naturally associated sequences that control gene expression. 
The invention also includes polynucleotides of the formula: 
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X-(Rl)n-( R 2 M R 3)n- Y 

wherein, at the 5' end of the molecule, X is hydrogen, and at the 3' end of the molecule, Y is hydrogen or a metal, R, 
5 and R 3 is any nucleic acid residue, n is an integer between 1 and 3000, and R 2 is a nucleic acid sequence of the 
invention, particularly a nucleic acid sequence selected from the group set forth in the Sequence Listing. In the poly- 
nucleotide formula above R 2 is oriented so that its 5' end residue is at the left, bound to R^ and its 3' end residue is 
at the right, bound to R 3 . Any stretch of nucleic acid residues denoted by either R group, where R is greater than 1, 
may be either a heteropofymer or a homopolymer, preferably a heteropolymer In a preferred embodiment n is an 
io integer between 1 and 1000, or 2000 or 3000. 

The term "polynucleotide encoding a polypeptide" as used herein encompasses polynucleotides that include a 
sequence encoding a polypeptide of the invention, particularly a bacterial polypeptide and more particularly a polypep- 
tide of the Staphylococcus aureus having an amino acid sequence set out in the Sequence Listing. The term also 
encompasses polynucleotides that include a single continuous region or discontinuous regions encoding the polypep- 
15 tide (for example, interrupted by integrated phage or an insertion sequence or editing) together with additional regions, 
that also may contain coding and/or non-coding sequences. 

The invention further relates to variants of the polynucleotides described herein that encode for variants of the 
polypeptide having the deduced amino acid sequence of the Sequence Listing. Variants that are fragments of the 
polynucleotides of the invention may be used to synthesize full-length polynucleotides of the invention. 
20 Further particularly preferred embodiments are polynucleotides encoding polypeptide variants, that have the amino 

acid sequence of a polypeptide of the Sequence Listing in which several, a few, 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino 
acid residues are substituted, deleted or added, in any combination. Especially preferred among these are silent sub- 
stitutions, additions and deletions, that do not alter the properties and activities of such polynucleotide. 

Further preferred embodiments of the invention are polynucleotides that are at least 50%, 60% or 70% identical 
25 over their entire length to a polynucleotide encoding a polypeptide having the amino acid sequence set out in the 
Sequence Listing, and polynucleotides that are complementary to such polynucleotides. Alternatively, most highly 
preferred are polynucleotides that comprise a region that is at least 80% identical over its entire length to a polynucle- 
otide encoding a polypeptide of the deposited strain and polynucleotides complementary thereto. In this regard, poly- 
nucleotides at least 90% identical over their entire length to the same are particularly preferred, and among these 
30 particularly preferred polynucleotides, those with at least 95% are especially preferred. Furthermore, those with at least 
97% are highly preferred among those with at least 95%, and among these those with at least 98% and at least 99% 
are particularly highly preferred, with at least 99% being the more preferred. 

A preferred embodiment is an isolated polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of: a polynucleotide having at least a 50% identity to a polynucleotide encoding a polypeptide com- 
35 prising the amino acid sequence of the Sequence Listing and obtained from a prokaryotic species other than S. aureus; 
and a polynucleotide encoding a polypeptide comprising an amino acid sequence which is at least 50% identical to 
the amino acid sequence of the Sequence Listing and obtained from a prokaryotic species other than S. aureus. 

Preferred embodiments are polynucleotides that encode polypeptides that retain substantially the same biological 
function or activity as the mature polypeptide encoded by the DNA of the Sequence Listing. 
40 The invention further relates to polynucleotides that hybridize to the herein above-described sequences. In this 

regard, the invention especially relates to polynucleotides that hybridize under stringent conditions to the herein above- 
described polynucleotides. As herein used, the terms "stringent conditions" and "stringent hybridization conditions" 
mean hybridization will occur only if there is at least 95% and preferably at least 97% identity between the sequences. 
An example of stringent hybridization conditions is overnight incubation at 42° C in a solution comprising: 50% forma- 
ts mide, 5x SSC (1 50mM NaCI, 1 5mM trisodium citrate), 50 mM sodium phosphate (pH7.6), 5x Denhardt's solution, 1 0% 
dextran sulfate, and 20 micrograms/ml denatured, sheared salmon sperm DNA, followed by washing the hybridization 
support in 0.1x SSC at about 65°C. Hybridization and wash conditions are well known and exemplified in Sambrook, 
era/., Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y., (1989), particularly Chapter 
11 therein. 

50 The invention also provides a polynucleotide consisting essentially of a polynucleotide sequence obtainable by 

screening an appropriate library containing the complete gene for a polynucleotide sequence set forth in the Sequence 
Listing under stringent hybridization conditions with a probe having the sequence of said polynucleotide sequence or 
a fragment thereof; and isolating said DNA sequence. Fragments useful for obtaining such a polynucleotide include, 
for example, probes and primers described elsewhere herein. 

55 As discussed additionally herein regarding polynucleotide assays of the invention, for instance, polynucleotides 

of the invention as discussed above, may be used as a hybridization probe for RNA, cDNAand genomic DNA to isolate 
full-length cDNAs and genomic clones encoding a polypeptide and to isolate cDNA and genomic clones of other genes 
that have a high sequence similarity to a polynucleotide set forth in the Sequence Listing. Such probes generally will 
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comprise at least 15 bases. Preferably, such probes will have at least 30 bases and may have at least 50 bases. 
Particularly preferred probes will have at least 30 bases and will have 50 bases or less. 

For example, the coding region of each gene that comprises or is comprised by a polynucleotide set forth in the 
Sequence Listing may be isolated by screening using a DNA sequence provided in the Sequence Listing to synthesize 
5 an oligonucleotide probe. A labeled oligonucleotide having a sequence complementary to that of a gene of the invention 
is then used to screen a library of cDNA, genomic DNA or mRNA to determine which members of the library the probe 
hybridizes to. 

The polynucleotides and polypeptides of the invention may be employed, for example, as research reagents and 
materials for discovery of treatments of and diagnostics for disease, particularly human disease, as further discussed 
10 herein relating to polynucleotide assays. 

Polynucleotides of the invention that are oligonucleotides derived from the a polynucleotide or polypeptide se- 
quence set forth in the Sequence Listing may be used in the processes herein as described, but preferably for PGR, 
to determine whether or not the polynucleotides identified herein in whole or in part are transcribed in bacteria in infected 
tissue. It is recognized that such sequences will also have utility in diagnosis of the stage of infection and type of 
is infection the pathogen has attained. 

The invention also provides polynucleotides that may encode a polypeptide that is the mature protein plus additional 
amino or carboxyl-terminal amino acids, or amino acids interior to the mature polypeptide (when the mature form has 
more than one polypeptide chain, for instance). Such sequences may play a role in processing of a protein from pre- 
cursor to a mature form, may allow protein transport, may lengthen or shorten protein half -life or may facilitate manip- 
20 ulation of a protein for assay or production, among other things. As generally is the case in vivo, the additional amino 
acids may be processed away from the mature protein by cellular enzymes. 

A precursor protein, having the mature form of the polypeptide fused to one or more prosequences may be an 
inactive form of the polypeptide. When prosequences are removed such inactive precursors generally are activated. 
Some or all of the prosequences may be removed before activation. Generally, such precursors are called proproteins. 
25 in addition to the standard A, G, C, T/U representations for nucleic acid bases, the term "NT is also used. "N" means 

that any of the four DNA or RNA bases may appear at such a designated position in the DNA or RNA sequence, except 
it is preferred that N is not a base that when taken in combination with adjacent nucleotide positions, when read in the 
correct reading frame, would have the effect of generating a premature termination codon in such reading frame. 

In sum, a polynucleotide of the invention may encode a mature protein, a mature protein plus a leader sequence 
30 (which may be referred to as a preprotein), a precursor of a mature protein having one or more prosequences that are 
not the leader sequences of a preprotein, or a preproprotein, which is a precursor to a proprotein, having a leader 
sequence and one or more prosequences, which generally are removed during processing steps that produce active 
and mature forms of the polypeptide. 

35 Vectors, host cells, expression 

The invention also relates to vectors that comprise a polynucleotide or polynucleotides of the invention, host cells 
that are genetically engineered with vectors of the invention and the production of polypeptides of the invention by 
recombinant techniques. Cell-free translation systems can also be employed to produce such proteins using RNAs 

40 derived from the DNA constructs of the invention. 

For recombinant production, host cells can be genetically engineered to incorporate expression systems or portions 
thereof or polynucleotides of the invention. Introduction of a polynucleotide into the host cell can be effected by methods 
described in many standard laboratory manuals, such as Davis et al., BASIC METHODS IN MOLECULAR BIOLOGY, 
(1986) and Sambrook et al., MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed., Cold Spring Harbor Lab- 

45 oratory Press, Cold Spring Harbor, N.Y. (1989), such as, calcium phosphate transfection, DEAE-dextran mediated 
transfection, transvection, microinjection, cation ic lip id-mediated transfection, electroporation, transduction, scrape 
loading, ballistic introduction and infection. 

Representative examples of appropriate hosts include bacterial cells, such as staphylococci, staphylococci, ente- 
rococci E. coli, streptomyces and Bacillus subtilis cells; fungal cells, such as yeast cells and Aspergillus cells; insect 

so cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS, HeLa, C127, 3T3, BHK, 293 
and Bowes melanoma cells; and plant cells. 

A great variety of expression systems can be used to produce the polypeptides of the invention. Such vectors 
include, among others, chromosomal, episomal and virus-derived vectors, e.g., vectors derived from bacterial plasm ids, 
from bacteriophage, from transposons, from yeast episomes, from insertion elements, from yeast chromosomal ele- 

55 ments, from viruses such as baculoviruses, papova viruses, such as SV40, vaccinia viruses, adenoviruses, fowl pox 
viruses, pseudorabies viruses and retroviruses, and vectors derived from combinations thereof, such as those derived 
from plasmid and bacteriophage genetic elements, such as cosmids and phagemids. The expression system constructs 
may contain control regions that regulate as well as engender expression. Generally, any system or vector suitable to 
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maintain, propagate or express polynucleotides and/or to express a polypeptide in a host may be used for expression 
in this regard. The appropriate DNA sequence may be inserted into the expression system by any of a variety of well- 
known and routine techniques, such as, for example, those set forth in Sambrook et al., MOLECULAR CLONING, A 
LABORATORY MANUAL, (supra). 

5 For secretion of the translated protein into the lumen of the endoplasmic reticulum, into the periplasmic space or 

into the extracellular environment, appropriate secretion signals may be incorporated into the expressed polypeptide. 
These signals may be endogenous to the polypeptide or they may be heterologous signals. 

Polypeptides of the invention can be recovered and purified from recombinant cell cultures by well-known methods 
including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phos- 

10 phocellulose chromatography, hydrophobic interaction chromatography, affinity chromatography, hydroxy lapatite chro- 
matography, and lectin chromatography. Most preferably, high performance liquid chromatography is employed for 
purification. Well known techniques for refolding protein may be employed to regenerate active conformation when the 
polypeptide is denatured during isolation and or purification. 

is Diagnostic Assays 

This invention is also related to the use of the polynucleotides of the invention for use as diagnostic reagents. 
Detection of such polynucleotides in a eukaryote, particularly a mammal, and especially a human, will provide a diag- 
nostic method for diagnosis of a disease. Eukaryotes (herein also •individual(s)"), particularly mammals, and especially 
20 humans, infected with an organism comprising a gene of the invention may be detected at the nucleic acid level by a 
variety of techniques. 

Nucleic acids for diagnosis may be obtained from an infected individual's cells and tissues, such as bone, blood, 
muscle, cartilage, and skin. Genomic DNA may be used directly for detection or may be amplified enzymatically by 
using PCR or other amplification technique prior to analysis. RNA or cDNA may also be used in the same ways. Using 

25 amplification, characterization of the species and strain of prokaryote present in an individual, may be made by an 
analysis of the genotype of the prokaryote gene. Deletions and insertions can be detected by a change in size of the 
amplified product in comparison to the genotype of a reference sequence. Point mutations can be identified by hybrid- 
izing amplified DNA to labeled polynucleotide sequences of the invention. Perfectly matched sequences can be dis- 
tinguished from mismatched duplexes by RNase digestion or by differences in melting temperatures. DNA sequence 

30 differences may also be detected by alterations in the electrophoretic mobility of the DNA fragments in gels, with or 
without denaturing agents, or by direct DNA sequencing. See, e.g., Myers et al., Science, 230: 1242 (1985). Sequence 
changes at specific locations also may be revealed by nuclease protection assays, such as RNase and S1 protection 
or a chemical cleavage method. See, e.g., Cotton et al., Proc. Natl. Acad. Sci, USA, 55:4397-4401 (1985). 

Cells carrying mutations or polymorphisms in the gene of the invention may also be detected at the DNA level by 

35 a variety of techniques, to allow for serotyping, for example. For example, RT-PCR can be used to detect mutations. 
It is particularly preferred to used RT-PCR in conjunction with automated detection systems, such as, for example, 
GeneScan. RNA or cDNA may also be used for the same purpose, PCR or RT-PCR. As an example, PCR primers 
complementary to a nucleic acid encoding a polypeptide of the invention can be used to identify and analyze mutations. 
These primers may be used for, among other things, amplifying a DNA of the invention isolated from a sample derived 

40 from an individual. The primers may be used to amplify the gene isolated from an infected individual such that the gene 
may then be subject to various techniques for elucidation of the DNA sequence. In this way, mutations in the DNA 
sequence may be detected and used to diagnose infection and to serotype and/or classify the infectious agent. 

The invention further provides a process for diagnosing disease, preferably bacterial infections, more preferably 
infections by Staphylococcus aureus, and most preferably disease, such as, infections of the upper respiratory tract 

*s (e.g., otitis media, bacterial tracheitis, acute epiglottitis, thyroiditis), lower respiratory (e.g., empyema, lung abscess), 
cardiac (e.g., infective endocarditis), gastrointestinal (e.g., secretory diarrhoea, splenic absces, retroperitoneal ab- 
scess), CNS (e.g., cerebral abscess), eye (e.g., blepharitis, conjunctivitis, keratitis, endophthalmitis, preseptal and 
orbital cellulitis, dacryocystitis), kidney and urinary tract (e.g., epididymitis, intrarenal and perinephric absces, toxic 
shock syndrome), skin (e.g., impetigo, folliculitis, cutaneous abscesses, cellulitis, wound infection, bacterial myositis) 

50 bone and joint (e.g., septic arthritis, osteomyelitis), comprising determining from a sample derived from an individual 
a increased level of expression of polynucleotide having the sequence of the Sequence Listing. Increased or decreased 
expression of a polynucleotide of the invention can be measured using any on of the methods well known in the art 
for the quantitation of polynucleotides, such as, for example, amplification, PCR, RT-PCR, RNase protection, Northern 
blotting and other hybridization methods. 

55 in addition, a diagnostic assay in accordance with the invention for detecting over-expression of a polypeptide of 

the invention compared to normal control tissue samples may be used to detect the presence of an infection, for 
example. Assay techniques that can be used to determine levels of a protein, in a sample derived from a host are well- 
known to those of skill in the art. Such assay methods include radioimmunoassays, competitive-binding assays, West- 
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em Blot analysis and ELISA assays. 
Antibodies 

s The polypeptides of the invention or variants thereof, or cells expressing them can be used as an immunogen to 

produce antibodies immunospecific for such polypeptides. "Antibodies" as used herein includes monoclonal and pol- 
yclonal antibodies, chimeric, single chain, simianized antibodies and humanized antibodies, as well as Fab fragments, 
including the products of an Fab immunoglobulin expression library. 

Antibodies generated against the polypeptides of the invention can be obtained by administering the polypeptides 
10 or epitope-bearing fragments, analogues or cells to an animal, preferably a nonhuman, using routine protocols. For 
preparation of monoclonal antibodies, any technique known in the art that provides antibodies produced by continuous 
cell line cultures can be used. Examples include various techniques, such as those in Kohler, G. and Milstein, C, 
Nature 256: 495-497 (1 975); Kozbor et at., Immunology Today 4: 72 (1 983); Cole et al., pg. 77-96 in MONOCLONAL 
ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc. (1985). 
15 Techniques for the production of single chain antibodies (U.S. Patent No. 4,946,778) can be adapted to produce 

single chain antibodies to polypeptides of this invention. Also, transgenic mice, or other organisms such as other mam- 
mals, may be used to express humanized antibodies. 

Alternatively phage display technology may be utilized to select antibody genes with binding activities towards the 
polypeptide either from repertoires of PCR amplified v-genes of lymphocytes from humans screened for possessing 
20 recognition of a polypeptide of the invention or from naive libraries (McCafferty, J. et a!., (1 990), Nature 348, 552-554; 
Marks, J. et al., (1992) Biotechnology 10, 779-783). The affinity of these antibodies can also be improved by chain 
shuffling (Clackson, T. et al., (1991) Nature 352, 624-628). 

If two antigen binding domains are present each domain may be directed against a different epitope - termed 
'bispecific' antibodies. 

25 The above-described antibodies may be employed to isolate or to identify clones expressing the polypeptides to 

purify the polypeptides by affinity chromatography. 

Thus, among others, antibodies against a polypeptide of the invention may be employed to treat infections, par- 
ticularly bacterial infections and especially infections of the upper respiratory tract (e.g., otitis media, bacterial tracheitis, 
acute epiglottitis, thyroiditis), lower respiratory (e.g., empyema, lung abscess), cardiac (e.g., infective endocarditis), 

30 gastrointestinal (e.g., secretory diarrhoea, splenic absces, retroperitoneal abscess), CNS (e.g., cerebral abscess), eye 
(e.g.. blepharitis, conjunctivitis, keratitis, endophthalmitis, preseptal and orbital cellulitis, dacryocystitis), kidney and 
urinary tract (e.g., epididymitis, intrarenal and perinephric absces, toxic shock syndrome), skin (e.g., impetigo, follicu- 
litis, cutaneous abscesses, cellulitis, wound infection, bacterial myositis) bone and joint (e.g., septic arthritis, osteomy- 
elitis). 

35 Polypeptide variants include antigen icalfy, epitopically or immunologically equivalent variants that form a particular 

aspect of this invention. The term "antigen ically equivalent derivative" as used herein encompasses a polypeptide or 
its equivalent which will be specifically recognized by certain antibodies which, when raised to the protein or polypeptide 
according to the invention, interfere with the immediate physical interaction between pathogen and mammalian host. 
The term "immunologically equivalent derivative" as used herein encompasses a peptide or its equivalent which when 

40 used in a suitable formulation to raise antibodies in a vertebrate, the antibodies act to interfere with the immediate 
physical interaction between pathogen and mammalian host. 

The polypeptide, such as an antigen ically or immunologically equivalent derivative or a fusion protein thereof is 
used as an antigen to immunize a mouse or other animal such as a rat or chicken. The fusion protein may provide 
stability to the polypeptide. The antigen may be associated, for example by conjugation, with an immunogenic carrier 

45 protein for example bovine serum albumin (BSA) or keyhole limpet haemocyanin (KLH). Alternatively a multiple anti- 
genic peptide comprising multiple copies of the protein or polypeptide, or an antigenically or immunologically equivalent 
polypeptide thereof may be sufficiently antigenic to improve immunogenicity so as to obviate the use of a carrier. 

Preferably, the antibody or variant thereof is modified to make it less immunogenic in the individual. For example, 
if the individual is human the antibody may most preferably be "humanized"; where the complimentarity determining 

50 region(s) of the hybridoma-derived antibody has been transplanted into a human monoclonal antibody , for example 
as described in Jones, P. et al. (1986), Nature 321, 522-525 or Tempest et al.,(1991) Biotechnology 9, 266-273. 

The use of a polynucleotide of the invention in genetic immunization will preferably employ a suitable delivery 
method such as direct injection of plasmid DNA into muscles (Wolff et al., Hum Mol Genet 1 992, 1 :363, Manthorpe et 
al., Hum. Gene Ther. 1963:4, 419), delivery of DNA complexed with specific protein carriers (Wu et al., J Biol Chem. 

55 1989: 264,16985), coprecipitation of DNA with calcium phosphate (Benvenisty & Reshef, PNAS, 1986:83,9551), en- 
capsulation of DNA in various forms of liposomes (Kaneda et al., Science 1989:243,375), particle bombardment (Tang 
et al., Nature 1992, 356:152, Eisenbraun et al., DNA Cell Biol 1993, 12:791) and in vivo infection using cloned retroviral 
vectors (Seeger et al., PNAS 1984:81,5849). 
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Antagonists and agonists - assays and molecules 

Polypeptides of the invention may also be used to assess the binding of small molecule substrates and ligands in, 
for example, cells, cell-free preparations, chemical libraries, and natural product mixtures. These substrates and ligands 
5 may be natural substrates and ligands or may be structural or functional mimetics. See, e.g., Coligan at ai, Current 
Protocols in Immunology 1(2): Chapter 5 (1 991 ). 

The invention also provides a method of screening compounds to identify those which enhance (agonist) or block 
(antagonist) the action of a polypeptides or polynucleotides of the invention, particularly those compounds that are 
bacteriostatic and/or bacteriocidal. The method of screening may involve high -throughput techniques. For example, 
io to screen for agonists or antagoists, a synthetic reaction mix, a cellular compartment, such as a membrane, cell en- 
velope or cell wall, or a preparation of any thereof, comprising a polypeptide of the invention and a labeled substrate 
or ligand of such polypeptide is incubated in the absence or the presence of a candidate molecule that may be an 
agonist or antagonist of a polypeptide of the invention. The ability of the candidate molecule to agonize or antagonize 
a polypeptide of the invention is reflected in decreased binding of the labeled ligand or decreased production of product 
is from such substrate. Molecules that bind gratuitously, i.e., without inducing the effects of a polypeptide of the invention 
are most likely to be good antagonists. Molecules that bind well and increase the rate of product production from 
substrate are agonists. Detection of the rate or level of production of product from substrate may be enhanced by using 
a reporter system. Reporter systems that may be useful in this regard include but are not limited to colorimetric labeled 
substrate converted into product, a reporter gene that is responsive to changes in polynucleotide or polypeptide activity, 
20 and binding assays known in the art. 

Another example of an assay for antagonists of polypeptides of the invention is a competitive assay that combines 
any such polypeptide and a potential antagonist with a compound which binds such polypeptide, natural substrates or 
ligands, or substrate or ligand mimetics, under appropriate conditions for a competitive inhibition assay. A polypeptide 
of the invention can be labeled, such as by radioactivity or a colorimetric compound, such that the number of such 
25 polypeptide molecules bound to a binding molecule or converted to product can be determined accurately to assess 
the effectiveness of the potential antagonist. 

Potential antagonists include small organic molecules, peptides, polypeptides and antibodies that bind to a poly- 
nucleotide or polypeptide of the invention and thereby inhibit or extinguish its activity. Potential antagonists also may 
be small organic molecules, a peptide, a polypeptide such as a closely related protein or antibody that binds the same 
30 sites on a binding molecule, such as a binding molecule, without inducing activities induced by a polypeptide of the 
invention, thereby preventing the action of such polypeptide by excluding it from binding. 

Potential antagonists include a small molecule that binds to and occupies the binding site of the polypeptide thereby 
preventing binding to cellular binding molecules, such that normal biological activity is prevented. Examples of small 
molecules include but are not limited to small organic molecules, peptides or peptide-like molecules. Other potential 
35 antagonists include antisense molecules (see Okano, J. Neurochem. 56:560 (1991); OLIGODEOXYNUCLEOTIDES 
AS ANTISENSE INHIBITORS OF GENE EXPRESSION, CRC Press, Boca Raton, FL (1 988), for a description of these 
molecules). Preferred potential antagonists include compounds related to and variants of a polypeptide of the invention. 

Each of the DNA sequences provided herein may be used in the discovery and development of antibacterial com- 
pounds. The encoded protein, upon expression, can be used as a target for the screening of antibacterial drugs. Ad- 
40 ditionally, the DNA sequences encoding the amino terminal regions of the encoded protein or Shine-Delgarno or other 
translation facilitating sequences of the respective mRNA can be used to construct antisense sequences to control the 
expression of the coding sequence of interest. 

The invention also provides the use of the polypeptide, polynucleotide or inhibitor of the invention to interfere with 
the initial physical interaction between a pathogen and mammalian host responsible for sequelae of infection. In par- 
45 ticular the molecules of the invention may be used: in the prevention of adhesion of bacteria, in particular gram positive 
bacteria, to mammalian extracellular matrix proteins on in-dwelling devices or to extracellular matrix proteins in wounds; 
to block protein-mediated mammalian cell invasion by, for example, initiating phosphorylation of mammalian tyrosine 
kinases (Rosenshine etai, Infect Immun. 6(7.2211 (1992); to block bacterial adhesion between mammalian extracel- 
lular matrix proteins and bacterial proteins that mediate tissue damage and; to block the normal progression of patho- 
50 genesis in infections initiated other than by the implantation of in-dwelling devices or by other surgical techniques. 

The antagonists and agonists of the invention may be employed, for instance, to inhibit and treat disease, such 
as, infections of the upper respiratory tract (e.g., otitis media, bacterial tracheitis, acute epiglottitis, thyroiditis), lower 
respiratory (e.g., empyema, lung abscess), cardiac (e.g., infective endocarditis), gastrointestinal (e.g., secretory diar- 
rhoea, splenic absces, retroperitoneal abscess), CNS (e.g., cerebral abscess), eye (e.g., blepharitis, conjunctivitis, 
55 keratitis, endophthalmitis, preseptal and orbital cellulitis, dacryocystitis), kidney and urinary tract (e.g., epididymitis, 
intrarenal and perinephric absces, toxic shock syndrome), skin (e.g., impetigo, folliculitis, cutaneous abscesses, cel- 
lulitis, wound infection, bacterial myositis) bone and joint (e.g., septic arthritis, osteomyelitis). 

Helicobacter pylori (here in H. pylon) bacteria infect the stomachs of over one-third of the world's population causing 
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stomach cancer, ulcers, and gastritis (International Agency for Research on Cancer (1994) Schistosomes, Liver Flukes 
and Helicobacter Pylori (International Agency for Research on Cancer, Lyon, France; http://www.uicc.ch/ecp/ 
ecp2904.htm). Moreover, the international Agency for Research on Cancer recently recognized a cause-and-effect 
relationship between H. pylori and gastric adenocarcinoma, classifying the bacterium as a Group I (definite) carcinogen. 
5 Preferred antimicrobial compounds of the invention found using screens provided by the invention, particularly broad- 
spectrum antibiotics, should be useful in the treatment of H. pylori infection. Such treatment should decrease the advent 
of H. py/o/7-induced cancers, such as gastrointestinal carcinoma. Such treatment should also cure gastric ulcers and 
gastritis. 

io Vaccines 

Another aspect of the invention relates to a method for inducing an immunological response in an individual, par- 
ticularly a mammal which comprises inoculating the individual with a polypeptide of the invention, or a fragment or 
variant thereof, adequate to produce antibody and/ or T cell immune response to protect said individual from infection, 

is particularly bacterial infection and most particularly Staphylococcus aureus infection. Also provided are methods where- 
by such immunological response slows bacterial replication. Yet another aspect of the invention relates to a method 
of inducing immunological response in an individual which comprises delivering to such individual a nucleic acid vector 
to direct expression of a polynucleotide or polypeptide of the invention, or a fragment or a variant thereof, for expressing 
such polynucleotide or polypeptide, or a fragment or a variant thereof in vivo in order to induce an immunological 

20 response, such as, to produce antibody and/ or T cell immune response, including, for example, cytokine-producing 
T cells or cytotoxic T cells, to protect said individual from disease, whether that disease is already established within 
the individual or not. One way of administering the gene is by accelerating it into the desired cells as a coating on 
particles or otherwise. 

Such nucleic acid vector may comprise DNA, RNA, a modified nucleic acid, or a DNA/RNA hybrid. 

25 a further aspect of the invention relates to an immunological composition which, when introduced into an individual 

capable or having induced within it an immunological response, induces an immunological response in such individual 
to a polynucleotide of the invention or protein coded therefrom, wherein the composition comprises a recombinant 
polynucleotide or protein coded therefrom comprising DNA which codes for and expresses an antigen of said polynu- 
cleotide or protein coded therefrom. The immunological response may be used therapeutically or prophylactically and 

30 may take the form of antibody immunity or cellular immunity such as that arising from CTL or CD4+ T cells. 

A polypeptide of the invention or a fragment thereof may be fused with co-protein which may not by itself produce 
antibodies, but is capable of stabilizing the first protein and producing a fused protein which will have immunogenic 
and protective properties. Thus fused recombinant protein, preferably further comprises an antigenic co-protein, such 
as lipoprotein D from Hemophilus influenzae, Glutathione-S-transf erase (GST) or beta-galactosidase, relatively large 

35 co-proteins which solubilize the protein and facilitate production and purification thereof. Moreover, the co-protein may 
act as an adjuvant in the sense of providing a generalized stimulation of the immune system. The co-protein may be 
attached to either the amino or carboxy terminus of the first protein. 

Provided by this invention are compositions, particularly vaccine compositions, and methods comprising the 
polypeptides or polynucleotides of the invention and immunostimulatory DNA sequences, such as those described in 

40 Sato, Y. et al Science 273: 352 (1996). 

Also, provided by this invention are methods using the described polynucleotide or particular fragments thereof 
which have been shown to encode non-variable regions of bacterial cell surface proteins in DNA constructs used in 
such genetic immunization experiments in animal models of infection with Staphylococcus aureus will be particularly 
useful for identifying protein epitopes able to provoke a prophylactic or therapeutic immune response. It is believed 

45 that this approach will allow for the subsequent preparation of monoclonal antibodies of particular value from the req- 
uisite organ of the animal successfully resisting or clearing infection for the development of prophylactic agents or 
therapeutic treatments of bacterial infection, particularly Staphylococcus aureus infection, in mammals, particularly, 
humans. 

The polypeptide may be used as an antigen for vaccination of a host to produce specific antibodies which protect 
50 against invasion of bacteria, for example by blocking adherence of bacteria to damaged tissue. Examples of tissue 
damage include wounds in skin or connective tissue caused, e.g., by mechanical, chemical or thermal damage or by 
implantation of indwelling devices, or wounds in the mucous membranes, such as the mouth, mammary glands, urethra 
or vagina. 

The invention also includes a vaccine formulation which comprises an immunogenic recombinant protein of the 
55 invention together with a suitable carrier. Since the protein may be broken down in the stomach, it is preferably admin- 
istered parenterally, including, for example, administration that is subcutaneous, intramuscular, intravenous, or intra- 
dermal. Formulations suitable for parenteral administration include aqueous and non -aqueous sterile injection solutions 
which may contain anti-oxidants, buffers, bacteriostats and solutes which render the formulation isotonic with the bodily 
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fluid, preferably the blood, of the individual; and aqueous and non-aqueous sterile suspensions which may include 
suspending agents or thickening agents. The formulations may be presented in unit-dose or multi-dose containers, for 
example, sealed ampules and vials and may be stored in a freeze-dried condition requiring only the addition of the 
sterile liquid carrier immediately prior to use. The vaccine formulation may also include adjuvant systems for enhancing 

5 the immunogenicity of the formulation, such as oil-in water systems and other systems known in the art. The dosage 
will depend on the specific activity of the vaccine and can be readily determined by routine experimentation. 

While the invention has been described with reference to certain protein, such as, for example, those set forth in 
the Sequence Listing, it is to be understood that this covers fragments of the naturally occurring protein and similar 
proteins with additions, deletions or substitutions which do not substantially affect the immunogenic properties of the 

io recombinant protein. 

Compositions, kits and administration 

The invention also relates to compositions comprising the polynucleotide or the polypeptides discussed above or 
15 their agonists or antagonists. The polypeptides of the invention may be employed in combination with a non-sterile or 
sterile carrier or carriers for use with cells, tissues or organisms, such as a pharmaceutical carrier suitable for admin- 
istration to a subject. Such compositions comprise, for instance, a media additive or a therapeutically effective amount 
of a polypeptide of the invention and a pharmaceutical acceptable carrier or excipient. Such carriers may include, 
but are not limited to, saline, buffered saline, dextrose, water, glycerol, ethanol and combinations thereof. The formu- 
20 lation should suit the mode of administration. The invention further relates to diagnostic and pharmaceutical packs and 
kits comprising one or more containers filled with one or more of the ingredients of the aforementioned compositions 
of the invention. 

Polypeptides and other compounds of the invention may be employed alone or in conjunction with other com- 
pounds, such as therapeutic compounds. 

25 The pharmaceutical compositions may be administered in any effective, convenient manner including, for instance, 

administration by topical, oral, anal, vaginal, intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal or 
intradermal routes among others. 

In therapy or as a prophylactic, the active agent may be administered to an individual as an injectable composition, 
for example as a sterile aqueous dispersion, preferably isotonic. 

30 Alternatively the composition may be formulated fortopical application for example in the form of ointments, creams, 

lotions, eye ointments, eye drops, ear drops, mouthwash, impregnated dressings and sutures and aerosols, and may 
contain appropriate conventional additives, including, for example, preservatives, solvents to assist drug penetration, 
and emollients in ointments and creams. Such topical formulations may also contain compatible conventional carriers, 
for example cream or ointment bases, and ethanol or oleyl alcohol for lotions. Such carriers may constitute from about 

35 i% to about 98% by weight of the formulation; more usually they will constitute up to about 80% by weight of the 
formulation. 

For administration to mammals, and particularly humans, it is expected that the daily dosage level of the active 
agent will be from 0.01 mg/kg to 10 mg/kg, typically around 1 mg/kg. The physician in any event will determine the 
actual dosage which will be most suitable for an individual and will vary with the age, weight and response of the 
40 particular individual. The above dosages are exemplary of the average case. There can, of course, be individual in- 
stances where higher or lower dosage ranges are merited, and such are within the scope of this invention. 

In-dwelling devices include surgical implants, prosthetic devices and catheters, i.e., devices that are introduced 
to the body of an individual and remain in position for an extended time. Such devices include, for example, artificial 
joints, heart valves, pacemakers, vascular grafts, vascular catheters, cerebrospinal fluid shunts, urinary catheters, 
45 continuous ambulatory peritoneal dialysis (CAPD) catheters. 

The composition of the invention may be administered by injection to achieve a systemic effect against relevant 
bacteria shortly before insertion of an in -dwelling device. Treatment may be continued after surgery during the in-body 
time of the device. In addition, the composition could also be used to broaden perioperative cover for any surgical 
technique to prevent bacterial wound infections, especially Staphylococcus aureus wound infections. 
50 Many orthopedic surgeons consider that humans with prosthetic joints should be considered for antibiotic proph- 

ylaxis before dental treatment that could produce a bacteremia. Late deep infection is a serious complication sometimes 
leading to loss of the prosthetic joint and is accompanied by significant morbidity and mortality. It may therefore be 
possible to extend the use of the active agent as a replacement for prophylactic antibiotics in this situation. 

In addition to the therapy described above, the compositions of this invention may be used generally as a wound 
55 treatment agent to prevent adhesion of bacteria to matrix proteins exposed in wound tissue and for prophylactic use 
in dental treatment as an alternative to, or in conjunction with, antibiotic prophylaxis. 

Alternatively, the composition of the invention may be used to bathe an indwelling device immediately before in- 
sertion. The active agent will preferably be present at a concentration of 1 fig/ml to 1 0mg/ml for bathing of wounds or 



16 



EP 0 841 394 A2 



indwelling devices. 

A vaccine composition is conveniently in injectable form. Conventional adjuvants may be employed to enhance 
the immune response. A suitable unit dose for vaccination is 0.5-5 microgram/kg of antigen, and such dose is preferably 
administered 1-3 times and with an interval of 1-3 weeks. With the indicated dose range, no adverse toxicological 
5 effects will be observed with the compounds of the invention which would preclude their administration to suitable 
individuals. 

Each reference disclosed herein is incorporated by reference herein in its entirety. Any patent application to which 
this application claims priority is also incorporated by reference herein in its entirety. 

io TABLES 

Certain pertinent data for each of the polypeptides and polynucleotides set forth in the Sequence Listing are sum- 
marized in the following Table. 

is TABLE 1 

Provided in this Table is the closest homologue of each polypeptide encoded by each ORF of the invention based 
on a comparison of the sequences of in the Sequence Listing with sequences available in the public domain (see the 
right hand column entitled "Desc"). Where no significant homologue was detected "unknown" appears in the column. 

20 Preferred polypeptides encoded by the ORFs of the invention, particularly full length proteins either obtained using 
such ORFs or encoded entirely by such ORFs, are ones that have a biological function of the homologue listed, among 
other functions. The analysis used to determine each homologue listed in Table 1 was either BlastP, BlastX or MP- 
Search, each of which is well known. In the left hand column the SEQ ID NO: of each DNA sequence in the Sequence 
Listing appears. In the center column, the SEQ ID NO: of each protein (polypeptide) sequence of each sequence in 

25 the Sequence Listing appears across from the DNA sequence which encodes it (which DNA sequence is in the left 
hand column). In some instances the DNA sequence encodes more than one protein sequence and so is listed more 
than once, each listing being shown next to the encoded protein sequence. 
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TABLE 1 





DNA 


Protein 


Assembly 


Description 


5 


No.: 


No.: 


ID 






1. 


260. 


2698808 


Unknown 




2. 


261. 


2700024 


Unknown 




3. 


262. 


2700052 


Probable ABC Transporter 


10 


4. 


263. 


2700068 


Unknown 




5. 


264. 


2700110 


Unknown 




6. 


265. 


2700242 


Poly ribo-Nucleotide Nuc leotidy l-Transferase 
(Ec 2.7.7.8) (Polynucleotide Phosphor 


15 








Ylase) (Pnpase). - Escherichia Coli. 




7. 


266. 


2700302 


Unknown 




8. 


267. 


2700520 


Unknown 




9. 


268. 


2700548 


Unknown 


20 


10. 


269. 


2700738 


Unknown 




11. 


270. 


2700754 


Unknown 




12. 


271. 


2701136 


Excinuc lease ABC Subunit B (Dina Protein) 
(Fragment). - Bacillus Subtilis. 


25 


13. 


272. 


2701728 


Unknown 




14. 


273. 


2702000 


Unknown 




15. 


274. 


2702042 


2-Oxoglutarate Dehydrogenase El 


30 








Component (Ec 1.2.4.2) (Alpha- 








Ketoglutarate De- Hydrogenase). - 
Escherichia Coli. 




16. 


275. 


2702062 


Unknown 


35 


17. 


276. 


2702076 


Unknown 


18. 


277. 


2702336 


Unknown 




19. 


278. 


2702342 


Unknown 




20. 


279. 


2702482 


Ribonuclease P Protein Component (Ec 


40 








3.1.26.5) (Protein C5) (Rnase P). - Bacill 








Us Subtilis. 




21. 


280. 


2702500 


Riboflavin Synthase Alpha Chain (Ec 
2.5.1.9). - Bacillus Subtilis. 


45 


22. 


281. 


2702520 


ECKBLTDH NCBI Gi: 41862 - Escherichia 
Coli. 




23. 


282. 


2702580 


Glucose Inhibited Division Protein B. - 
Bacillus Subtilis. 


SO 


24. 


283. 


2702648 


Unknown 
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10 



15 



25 



35 



DNA 
No.: 

25. 

26. 

27. 
28. 
29. 



30. 
31. 
32. 
33. 
34. 
34. 
35. 
36. 
37. 
38. 
39. 
40. 



41. 



50 



42. 
43. 
44. 



45. 
46. 
47. 

48. 
49. 
50. 



Protein 
No.: 

284. 

285. 

286. 
287. 
288. 



289. 
290. 
291. 
292. 
293. 
294. 
295. 
296. 
297. 
298. 
299. 
300. 



301. 



302. 
303. 
304. 



305. 
306. 
307. 

308. 
309. 
310. 



Assembly Description 
ID 

2702700 Flagellar Hook-Associated Protein 1 (Hapl). 

- Salmonella Typhimurium. 
2702838 Pyrimidine Nucleoside Transport Protein. - 

Bacillus Subtilis. 
2702934 Protein Dfp. - Escherichia Coli/ 

2703752 Unknown 

2703916 Diaminopimelate Decarboxylase (Ec 

4.1.1 .20) (Dap Decarboxylase). - Bacillus 
Sub Tiiis. 

2704026 Unknown 

2704028 Unknown 

2704448 Unknown 

2704502 Unknown 

2704680 Unknown 

2704680 Unknown 

2704892 Unknown 

2705066 Unknown 

2705204 Unknown 

2705378 Unknown 

2705414 Unknown 

2705440 Methylated-DNA-Protein-Cysteine 
Methyltransferase (Datl) Homolog - 
Haemophilus Influenzae (Strain Rd KW20) 

2705526 Ornithine Carbamoyltransferase Chain I (Ec 

2.1.3.3) (Otcase-1) (Fragment). - Sa 
Lmonella Typhimurium. 

2705632 Unknown 

2705646 Unknown 

2705712 Ribonucleoside-Diphosphate Reductase 2 

Alpha Chain (Ec 1.17.4.1) (Ribonucleotid E 
Reductase 2). - Salmonella Typhimurium. 

2705740 Unknown 

2705742 Unknown 

2705876 D-Lactate Dehydrogenase (Ec 1. 1 . 1 .28) (D- 

Ldh). - Lactobacillus Plantarum. 
2706108 Unknown 
2706282 Unknown 
2706344 Unknown 



55 
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10 



15 



20 



25 



30 



40 



45 



50 



DNA 
No.: 
51. 

52. 

53. 
54. 

55. 



56. 
57. 
58. 



59. 
60. 
61. 
62. 
63. 

64. 
65. 
66. 
67. 
68. 
69. 
70. 
70. 
71. 
72. 
72. 
73. 
74. 
75. 
76. 



Protein 

No.: 
311. 

312. 

313. 
314. 

315. 



316. 
317. 
318. 



319. 
320. 
321, 
322. 
323. 

324. 
325. 
326. 
327. 
328. 
329. 
330. 
331. 
332. 
333. 
334. 
335. 
336. 
337. 
338. 



Assembly Description 
ID 

2706452 Nickel-Binding Periplasmic Protein 

Precursor. - Escherichia Coli. 
2706548 Stringent Response-Like Protein - 

Streptococcus Equisimilis 
2706730 Unknown 

2706734 Phosphate Regulatory Protein - Rhizobium 

Meliloti 

2706832 Diphtheria Toxin Repressor (Iron-Dependent 

Diphtheria Tox Regulatory Element) (Tox 
Regulatory Factor). - Corynebacterium 
Diphtheriae. 

2706930 Unknown 

2707190 Unknown 

2707592 Phosphoribosylaminoimidazole Carboxylase 

Catalytic Subunit (Ec 4. 1 . 1 .2 1) (Air C 
Arboxylase) (Aire). - Bacillus Subtilis. 

2707978 Unknown 

2708140 Unknown 

2708372 Unknown 

2708478 Unknown 

2708862 Nitrite Reductase (Nad(P)H) (Ec 1 .6.6.4). - 

Escherichia Coli. 
2709034 Unknown 
2709098 Unknown 
2709168 Unknown 
2709352 Unknown 

2709366 Rep827 Protein - Staphylococcus Sp. 

2709474 Unknown 

2709570 Unknown 

2709570 Unknown 

2709600 Unknown 

2709642 Unknown 

2709642 Unknown 

2709736 Unknown 

2709760 Unknown 

2709766 Unknown 

2709772 Unknown 
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15 



25 



30 



40 



45 



55 



DNA 
No.: 

77. 
78. 



79. 
80. 
81. 
82. 
83. 
84. 
85. 

86. 

87. 

88. 
89. 
90. 
91. 
92. 
93. 
94. 
95. 

96. 
96. 
97. 



98. 
99. 

100. 
101. 
102. 
103. 
104. 
105. 



Protein Assembly Description 

No.: ID 

339. 2709790 Unknown 

340. 2709830 Udp-N-AcetylmuramoylaIanine~D- 

Glutamate Ligase (Ec 6.3.2.9) (Udp-N- 
Acetylmura Noyl-L-Alanyl-D-Glutamate 
Synthetase). - Bacillus Subtilis. 

341. 2709848 Unknown 

342. 2709852 Unknown 

343. 2709888 Unknown 

344. 2709954 Unknown 

345. 2709992 Unknown 

346. 2709994 Heat Shock Protein Hslu. - Bacillus Subtilis. 

347. 2710014 Tryptophan Synthase (Ec 4.2.1.20). - 

Neurospora Crassa. 

348. 27 10028 Single-Strand Binding Protein (Ssb) (Helix- 

Destabilizing Protein). - Bacillus S Ubtilis. 

349. 2710092 Elongation Factor Tu (Ef-Tu). - Bacillus 

Subtilis. 

350. 2710096 Unknown 

351. 2710272 Unknown 

352. 2710276 Unknown 

353. 2710330 Unknown 

354. 2943530 Unknown 

355. 2943556 Unknown 

356. 2943564. Unknown 

357. 2943624 Possible Sodium-Dependent Phosphate 

Transporter 

358. 2943704 Unknown 

359. 2943704 Unknown 

360. 2943710 Anaerobic Ribonucleoside-Triphosphate 

Reductase (Ec 1.17.4.2). - Escherichia Co 
Li. 

361. 2943716 Unknown 

362. 2943742 U00013 NCBI Gi: 466868NCBI Gi: - 

Mycobacterium Leprae. 

363. 2943746 Unknown 

364. 2943774 Unknown 

365. 2943782 Unknown 

366. 2943806 Unknown 

367. 2943822 Unknown 

368. 2943834 Unknown 
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DNA 


Protein 


Assembly 


Description 




No.: 


No.: 


ID 




5 


106. 


369. 


2943864 


Unknown 




107. 


370. 


2943956 


Unknown 




108. 


371. 


2943960 


Unknown 




108. 


372. 


2943960 


Unknown 


10 


109. 


373. 


2944036 


Deoxyribose-Phosphate Aldolase (EC 










4. 1 .2.4) - Mycoplasma Hominis (SGC3) 




110. 


374. 


2944044 


Unknown 


15 


110. 


375. 


2944044 


Biotin Synthetase (Ec 2.8.1.-). - Bacillus 








Sphaericus. 




111. 


376. 


2944066 


Unknown 




112. 


377. 


2944114 


Unknown 




113. 


378. 


2944126 


Sirohem Synthase (Contains: Uroporphyrin- 


20 








Iii C-Methyitransferase (Ec 2.1.1.107) 










(Urogen Iii Methylase) (Sumt) 










(Uroporphyrinogen Iii Methylase) / 


25 








Precorrin-2 Oxidase (Ec I.-.-.-) / 










Ferrochelatase (Ec 4.99.1.-)). - Escherichia 










Coli. 


30 


114. 


379. 


2944146 


Mercuric Reductase (Ec 1.16.1.1) (Hg(Ii) 








Reductase). - Staphylococcus Aureus. 




115. 


380. 


2944210 


Unknown 




116. 


381. 


2944212 


Unknown 


35 


117. 


382. 


2944262 


Regulatory Protein Pfor - Clostridium 










Perfringens 




118. 


383. 


2944264 


Replicative Dna Helicase (Ec 3.6.1.-). - 










Bacillus Subtilis. 


40 


119. 


384. 


2944276 


Ribokinase (Rbsk) Homolog - Haemophilus 










Influenzae (Strain Rd KW20) 




120. 


385. 


2944306 


Dna-Invertase Hin. - Salmonella 










Typhimurium. 


45 


121. 


386. 


2944308 


Unknown 




122. 


387. 


2944310 


Histidine Ammonia-Lyase (Ec 4.3.1.3) 










(Histidase). - Bacillus Subtilis. 




123. 


388. 


2944342 


Unknown 


50 


124. 


389. 


2944344 


Primosomal Protein N' (Replication Factor 










Y). - Escherichia Coli. 




125. 


390. 


2944360 


Unknown 




126. 


391. 


2944364 


Unknown 
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50 



DNA 
No.: 

127. 

128. 
128. 

129. 



130. 
131. 
132. 
133. 
134. 

135. 



136. 
137. 

138. 



139. 



140. 



141. 
142 

143. 



144. 



Protein 
No.: 

392. 

393. 
394. 

395. 



396. 
397. 
398. 
399. 
400. 



401. 



402. 
403. 

404. 



405. 



406. 



407. 
408. 

409. 



410. 



Assembly Description 
ID 

2944366 Starvation Sensing Protein Rspb (Ec 1.1.1.). 

- Escherichia Coli. 
2944390 Probable ABC Transporter 

2944390 Atp-Binding Protein Abe. - Escherichia 

Coli. 

2944414 Potassium-Transporting Atpase (Ec 

3.6.1.36), A Chain (Atp Phosphohydrolase 
(Pot Assium-Transporting), A Chain). - 
Escherichia Coli. 

2944426 Unknown 

2944434 Unknown 

2944446 Unknown 

2944452 Unknown 

2944464 Lipoprotein Nlpd Precursor. - Escherichia 

Coli. 

2944492 Pts System, Galacticol-Specific Iia 

Component (Eiia-Gat) (Galacticol- Permease 
Iia Component) (Phosphotransferase 
Enzyme Ii, A Component) (Ec 2.7.1.69). - E 
Scherichia Coli. 

2944508 Unknown 

2944522 1 -Phosphofructokinase (Fruk) Homolog - 

Haemophilus Influenzae (Strain Rd KW20) 

2944556 Folylpolyglutamate Synthase (Ec 6.3.2. 17) 
(Folylpoly-Gamma-Glutamate Synthetase) 
(Fpgs). - Bacillus Subtilis. 

2944576 Lactam Utilization Protein (Lamb) Homolog 

- Haemophilus Influenzae (Strain Rd K 
W20) 

2944578 Lactam Utilization Protein (Lamb) Homolog 

- Haemophilus Influenzae (Strain Rd K 
W20) 

2944584 Possible Glutathione Peroxidase 

2944592 BK5TATTP NCB - Bacteriophage BK5-T 

DNA. 

2944644 Phosphoribosylformylglycinamidine Cyclo- 

Ligase (Ec 6.3.3.1) (Airs) (Phosphoribo 
Syl-Aminoimidazole Synthetase) (Air 
Synthase). - Bacillus Subtilis. 

2944648 Unknown 
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10 



20 



30 



DNA 

No.: 
145. 
146. 



147. 
148. 
149. 
150. 
151. 
152. 
153. 



50 



154. 

155. 
156. 
157. 



158. 



159. 
159. 
160. 
161. 
162. 
163. 
164. 

165. 
166. 
167. 
168. 



Protein 

No.: 
411. 
412. 



413. 
414. 
415. 
416. 
417. 
418. 
419. 



420. 

421. 
422. 
423. 



424. 



425. 
426. 
427. 
428. 
429. 
430. 
431. 

432. 
433. 
434. 
435. 



Assembly Description 

ro 

2944654 Unknown 

2944658 Indole-3-Glycerol Phosphate Synthase (Ec 

4. 1 . 1 .48) (Igps). - Lactococcus Lactis 
(Subsp. Lactis) (Streptococcus Lactis). 

2944670 Unknown 

2944694 Unknown 

2944706 Unknown 

2944720 Unknown 

2944746 Unknown 

2944760 Unknown 

2944782 Pts System, Sucrose- Specific Iiabc 

Component (Eiiabc-Scr) (Sucrose- Permease 
Ii Abe Component) (Phosphotransferase 
Enzyme Ii, Abe Component) (Ec 2.7.1.69) 
(E Ii-Scr). - Streptococcus Mutans. 

2944800 Right Origin-Binding Protein. - Escherichia 

Coli. 

2944806 Unknown 
2944808 Unknown 

2944838 Phospho-N-Acetylmuramoyl-Pentapeptide- 
Transferase (Ec 2.7.8.13). - Bacillus Sub 
Tilis. 

2944882 Oligopeptide Transport Atp-Binding Protein 

Oppf. - Lactococcus Lactis (Subsp. L Actis) 
(Streptococcus Lactis). 

2944888 Unknown 

2944888 Unknown 

2944902 Unknown 

2944930 Unknown 

2944932 Unknown 

2944936 Unknown 

2944966 30s Ribosomal Protein S4 (Bs4). - Bacillus 

Subtilis. 
2944988 Unknown 
2945022 Unknown 
2945040 Unknown 
2945060 Unknown 
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45 



DNA 

No.: 
169. 
170. 



171. 
172. 
173. 
174. 
175. 
176. 
177. 



178. 

179. 
180. 
181. 
182. 

183. 
184. 

184. 

185. 

186. 
187. 

188. 

189. 
190. 
191. 
192. 

193. 



Protein 
No.: 
436. 
437. 



438. 
439. 
440. 
441. 
442. 
443. 
444. 



445. 

446. 
447. 
448. 
449. 

450. 
451. 

452. 

453. 

454. 
455. 

456. 

457. 
458. 
459. 
460. 

461. 
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Assembly Description 
ID 

2945070 Unknown 

2945078 Isocitrate Dehydrogenase (Nadp) (Ec 

l . 1 . 1 .42) (Oxalosuccinate Decarboxylase) (I 
Dh) (Nadp+-Specific ledh) (Idp). - Bacillus 
Subtilis. 

2945082 Unknown 

2945186 Unknown 

2945250 Unknown 

2945254 Unknown 

2945260 Unknown 

29453 16 Tagd Protein. - Vibrio Cholerae. 

2945338 4-Methyl-5-(Beta-Hydroxyethyl)Thiazole 

Monophosphate Synthesis Protein Thif - E 

Scherichia Coli 
2945378 Pyrroline-5-Carboxylate Reductase (EC 

1.5.1.2) - Arabidopsis Thaliana 
2945474 Unknown 
2945520 Unknown 
2945550 Unknown 

2945580 Nadh Dehydrogenase I Chain G (Ec 1 .6.5.3). 

- Escherichia Coli. 
2945772 Unknown 

2945780 Signal Peptidase I S (Ec 3.4.2 1 .89) (Spase I) 

(Leader Peptidase I). - Bacillus Subtilis. 

2945780 Signal Peptidase I S (Ec 3.4.21.89) (Spase I) 
(Leader Peptidase I)- - Bacillus Subtilis. 

2945792 Fibronectin-Binding Protein Precursor 

(Fnbp). - Staphylococcus Aureus. 

2945794 Unknown 

2945802 Rod Shape-Determining Protein. - 

Escherichia Coli. 
3038342 X-Pro Dipeptidase (EC 3.4. 1 3.9) - 

Lactobacillus Delbrueckii 
3038352 Unknown 
3038362 Unknown 
3038372 Unknown 

3038374 Phosphotransferase System Enzyme II - 

Staphylococcus Carnosus 
3038406 Unknown 



55 
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Assembly Description 
ID 

30384 1 0 Crtd Protein - Rhodobacter Sphaeroides 

3038424 Tetrahydropteroyltriglutamate 

Methyltransferase (Mete) Homolog - 
Haemophilus In Fluenzae (Strain Rd 
KW20) 

3038426 Unknown 

3038428 Exopolysaccharide Production Protein Pss. - 

Rhizobium Leguminosarum (Biovar Pha 
Seoli). 

3038438 Galacticol- 1 -Phosphate Dehydrogenase (Ec 

1.1.1 .-.). - Escherichia Coli. 
3038442 Unknown 

3038450 Excinuclease Abe Subunit C. - Bacillus 

Subtilis. 

3038458 Nitrite Reductase (Nad(P)H) Small Subunit 

(Ec 1.6.6.4). - Bacillus Subtilis. 
3038474 Unknown 

3038482 Probable Imidazoleglycerol-Phosphate 

Dehydratase (Ec 4.2.1.19). - Anabaena Sp. 
(Strain Pec 7120). 

3038492 Unknown 

3038496 SXSCRBA Sucrose Repressor - 
Staphylococcus Xylosus. 

3038498 Nitrate Reductase Alpha Chain - Bacillus 

Subtilis (Fragment) 

3038498 Nitrate Reductase Alpha Chain - Bacillus 

Subtilis (Fragment) 

3038504 3-Isopropy Imalate Dehydratase (Ec 4.2. 1 .33) 

(Isopropylmalate Isomerase) (Alpha- Ipm 
Isomerase) (Ipmi). - Lactococcus Lactis 
(Subsp. Lactis) (Streptococcus La Ctis). 

3038504 3-lsopropylmalate Dehydratase (Ec 4.2. 1 .33) 

(Isopropylmalate Isomerase) (Alpha- Ipm 
Isomerase) (Ipmi). - Lactococcus Lactis 
(Subsp. Lactis) (Streptococcus La Ctis). 

3038510 Unknown 

3038536 Unknown 

3038538 Unknown 
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DNA 
No.: 
211. 

212. 

212. 

213. 
214. 
215. 
216. 



217. 



218. 

219. 

220. 
221. 
222. 
223. 
224. 
225. 
226. 
227. 
228. 

229. 
230. 

231. 
232. 
232. 
233. 
234. 

235. 
236. 



Protein 
No.: 
481. 

482. 

483. 

484. 
485. 
486. 
487. 



488. 



489. 

490. 

491. 
492. 
493. 
494. 
495. 
496. 
497. 
498. 
499. 

500. 
501. 

502. 
503. 
504. 
505. 
506. 

507. 
508. 



Assembly Description 
ID 

3038544 Formate Dehydrogenase Alpha Chain (Ec 

1.2.1.2). - Methanobacterium Formicicum. 

3038550 Peptide Transport System Permease Protein 

Sapb. - Salmonella Typhimuriiim. 

3038550 Dipeptide Transport System Permease 

Protein Dppb. - Bacillus Subtilis. 

3038552 Unknown 

3038578 Unknown 

3038588 Seca Protein - Staphylococcus Carnosus 

3038590 Di-Tripeptide Transporter. - Lactococcus 

Lactis (Subsp. Lactis) (Streptococcus 

Lactis). 

3038594 Histidinol Dehydrogenase (Ec 1 . 1 . 1 .23) 

(Hdh). - Lactococcus Lactis (Subsp. Lact Is) 

(Streptococcus Lactis). 
3038596 Histidinol Dehydrogenase (Ec 1 . 1 . 1 .23) 

(Hdh). - Escherichia Coli. 
3038624 Thiolase (EC 2.3.1.9) - Clostridium 

Acetobutylicum 
3038642 Exou Protein - Rhizobium Meliloti 

3038658 Unknown 
3038686 Unknown 
3038692 Unknown 
3038726 Unknown 
3038746 Unknown 
3038752 Unknown 
3038764 Unknown 

3038766 Glucokinase Regulator Homolog - 

Haemophilus Influenzae (Strain Rd KW20) 
3038772 Unknown 

3038798 Bile Acid-Coenzyme A Ligase (Ec 6.-.-.-). - 

Eubacterium Sp. (Strain Vpi 1 2708). 
30388 10 Probable Reductase 

3038816 Unknown 
3038816 Unknown 
3038848 Unknown 

3038858 3-Dehydroquinate Synthase (Ec 4.6. 1 .3). - 

Bacillus Subtilis. 
3038872 Unknown 
3038896 Unknown 
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DNA 


Protein 


Assembly 


Description 


5 


No.: 


No.: 


ID 






237. 


509. 


3038902 


Unknown 




238. 


510. 


3038954 


Unknown 




239. 


511. 


3038956 


Unknown 


10 


240. 


512. 


3038964 


PI 15 Protein - Mycoplasma Hyorhinis 
(SGC3) 




241. 


513. 


3039072 


Pyruvate Oxidase (EC 1.2.3.3) Mutant With 
Pro 178 Replaced By Ser, Ser 188 Rep] 


15 








Aced By Asn, And Ala 458 Replaced By 
Val (P178s,S188n,A458v), Chain A - Lacto 
Bacillus Plantarum 




242. 


514. 


3039118 


Unknown 


20 


243. 


515. 


3039122 


Unknown 




243. 


516. 


3039122 


Unknown 




244. 


517. 


3039124 


Unknown 




245. 


518. 


3039126 


Unknown 


25 


246. 


519. 


3039132 


Glucose-Fructose Oxidoreductase (EC 1.1.- 
.-) Precursor - Zymomonas Mobilis 




246. 


520. 


3039132 


Unknown 


30 


247. 


521. 


3039156 


Putative Ornithine Carbamoyl transferase (Ec 








2.1.3.3) (Otc) (Hypothetical Protein In Sagp 
3'region) (Fragment). - Streptococcus 
Pyogenes. 


35 


248. 


522. 


3039170 


Unknown 


249. 


523. 


3039172 


Carbarnoyl-Phosphate Synthase, Pyrimidine- 
Specific, Large Chain (Ec 6.3.5.5) (Ca 
Rbamoyl-Phosphate Synthetase Ammonia 


40 








Chain). - Bacillus Subtilis. 


250. 


524. 


3039176 


Acetoin(Diacetyl) Reductase (Ec 1 . 1 . 1 .5) 
(Acetoin Dehydrogenase) (Ar). - Klebsi Ella 
Terrigena. 


45 


251. 


525. 


3039182 


Formate Dehydrogenase Alpha Chain (Ec 
1.2.1.2). - Methanobacterium Formicicum. 




252. 


526. 


3039200 


Uvr-402 Protein - Streptococcus 
Pneumoniae Plasmid Psb470 


50 


253. 


527. 


3039234 


Unknown 




254. 


528. 


3039244 


CDP-RibitoI Pyrophosphorylase - 
Haemophilus Influenzae 




255. 


529. 


3039246 


Unknown 


55 


256. 


530. 


3039248 


Unknown 
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DNA 


Protein 


Assembly 




No.: 


No.: 


ED 




257. 


531. 


3039254 


Unknown 


257. 


532. 


3039254 


Unknown 


258. 


533. 


3039264 


Unknown 


259. 


534. 


3039282 


Unknown 



Description 



10 



15 



Example 1 

Isolation of DNA coding for a Novel Protein from S. Aureus WCUH 29 

The polynucleotide having the DNA sequence given herein can be obtained from a library of clones of chromosomal 
DNA of S. aureus WCUH 29 in E.coli. Libraries may be prepared by routine methods, for example: 



Methods 1 and 2. 

Total cellular DNA is isolated from Staphylococcus aureus strain WCUH29 (NCI MB 40771 ) according to standard 
procedures and size-fractionated by either of two methods. 



Method 1. 

25 

Total cellular DNA is mechanically sheared by passage through a needle in order to size-fractionate according to 
standard procedures. DNA fragments of up to 1 Ikbp in size are rendered blunt by treatment with exonuclease and 
DNA polymerase, and ScoR\ linkers added. Fragments are ligated into the vector Lambda Zapll that has been cut with 
EcoRI, the library packaged by standard procedures and E.coli infected with the packaged library. The library is am- 
30 pirfied by standard procedures. 



Method 2. 



Total cellular DNA is partially hydrolysed with a combination of four restriction enzymes (Rsa\, Pali, Alu\ and 
35 BsM 235I) and size -fractionated according to standard procedures. EcoRI linkers are ligated to the DNA and the frag- 
ments then ligated into the vector Lambda Zapll that have been cut with EcoRI, the library packaged by standard 
procedures, and E. cofi infected with the packaged library. The library is amplified by standard procedures. 

Example 2 

40 

The determination of expression during infection of a gene from Staphylococcus aureus WCUH 29 

Necrotic fatty tissue from a four day groin infection of Staphylococcus aureus WCUH29 in the mouse is efficiently 
disrupted and processed in the presence of chaotropic agents and RNAase inhibitor to provide a mixture of animal 
45 and bacterial RNA. The optimal conditions for disruption and processing to give stable preparations and high yields of 
bacterial RNA are followed by the use of hybridisation to a radiolabeled oligonucleotide specific to Staphylococcus 
aureus 16S RNA on Northern blots. The RNase free, DNase free, DNA and protein free preparations of RNA obtained 
are suitable for Reverse Transcription PCR (RT-PCR) using unique primer pairs designed from the sequence of each 
gene of Staphylococcus aureus WCUH29. 

50 

a) isolation of tissue infected with Staphylococcal aureus WCUH29 from a mouse animal model of Infection 



10 ml. volumes of sterile nutrient broth (No. 2 Oxoid) are seeded with isolated, individual colonies of Staphylococcus 
aureus WCUH29 from an agar culture plate. 
55 The cultures are incubated aerobically (static culture) at 37 degrees C for 1 6-20 hours . 4 week old mice (female, 

18g-22g, strain MFI) are each infected by subcutaneous injection of 0.5ml. of this broth culture of Staphylococcus 
aureus WCUH29 (diluted in broth to approximately 108 cfu/ml.) into the anterior , right lower quadrant (groin area). 
Mice should be monitored regularly during the first 24 hours after infection, then daily until termination of study Animals 
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with signs of systemic intection, i.e. lethargy, ruffled appearance, isolation from group, should be monitored closely 

and if signs progress to moribundancy, the animal should be culled immediately. 

Visible external signs of lesion development will be seen 24-48h after infection. Examination of the abdomen of 

the animal will show the raised outline of the abscess beneath the skin. The localised lesion should remain in the right 
s lower quadrant, but may occasionally spread to the left lower quadrant, and superiorly to the thorax. On occasions, 

the abscess may rupture through the overlying skin layers. The affected animal should be culled immediately and the 

tissues sampled if possible. Failure to cull the animal may result in the necrotic skin tissue overlying the abscess being 

sloughed off, exposing the abdominal muscle wall. 

Approximately 96h after infection, animals are killed using carbon dioxide asphyxiation. To minimise delay between 
10 death and tissue processing /storage, mice should be killed individually rather than in groups.The dead animal is placed 

onto its back and the fur swabbed liberally with 70% alcohol. An initial incision using scissors is made through the skin 

of the abdominal left lower quadrant, travelling superiorly up to, then across the thorax. The incision is completed by 

cutting interiorly to the abdominal lower right quadrant. Care should be taken not to penetrate the abdominal wall. 

Holding the skin flap with forceps, the skin is gently pulled way from the abdomen. The exposed abscess, which covers 
is the peritoneal wall but generally does not penetrate the muscle sheet completely, is excised, taking care not to puncture 

the viscera. 

The abscess/muscle sheet and other infected tissue, such as the necrotic pads of fatty tissue in the abdominal 
lower right and left quadrants may require cutting in sections, prior to flash-freezing in liquid nitrogen, thereby allowing 
easier storage in plastic collecting vials. 

20 

b) Isolation of Staphylococcus aureus WCUH29 RNA from infected tissue samples 

4-6 infected tissue samples (each approx 0.5-0.7g) in 2ml screw-cap tubes are removed from -80° C. storage into 
a dry ice ethanol bath In a microbiological safety cabinet the samples are disrupted individually whilst the remaining 

25 samples are kept cold in the dry ice ethanol bath. To disrupt the bacteria within the tissue sample 1 ml of TRlzol Reagent 
(Gibco BRL, Life Technologies) is added followed by enough 0.1mm zirconia/silica beads to almost fill the tube.the lid 
is replaced taking care not to get any beads into the screw thread so as to ensure a good seal and eliminate aerosol 
generation. The sample is then homogenised in a Mini-Bead Beater Type BX-4 (Biospec Products) Necrotic fatty tissue 
is treated for 100 seconds at 5000 rpm in order to achieve bacterial lysis. In vivo grown bacteria require longer treatment 

30 than in vitro grown S.aureus WCUH29 which are disrupted by a 30 second bead-beat. 

After bead-beating the tubes are chilled on ice before opening in a fume-hood as heat generated during disruption 
may degrade the TRlzol and release cyanide. 

200 microlitres of chloroform is then added and the tubes shaken by hand for 15 seconds to ensure complete 
mixing. After 2-3 minutes at room temperature the tubes are spun down at 12,000 x g, 4 °C for 15minutes and RNA 

35 extraction is then continued according to the method given by the manufacturers of TRlzol Reagent i.e.: - The aqueous 
phase, approx 0.6 ml, is transferred to a sterile eppendorf tube and 0.5 ml of isopropanol is added. After 10 minutes 
at room temperature the samples are spun at 1 2,000 x g, 4°C for 1 0 minutes. The supernatant is removed and discarded 
then the RNA pellet is washed with I ml 75% ethanol. A brief vortex is used to mix the sample before centrif uging at 
7,500 x g, 4 "C for 5 minutes. The ethanol is removed and the RNA pellet dried under vacuum for no more than 5 

40 minutes. Samples are then resuspended by repeated pipetting in 100 microlitres of DEPC treated water, followed by 
5-10 minutes at 55 "C .Finally, after at least 1 minute on ice, 200 units of Rnasin (Promega) is added 
RNA preparations are stored at -80 °C for up to one month. For longer term storage the RNA precipitate can be stored 
at the wash stage of the protocol in 75% ethanol for at least one year at -20"C. 

Quality of the RNA isolated is assessed by running samples on 1% agarose gels. 1 x TBE gels stained with ethidium 

45 bromide are used to visualise total RNA yields. To demonstrate the isolation of bacterial RNA from the infected tissue 
1 x MOPS, 2.2M formaldehyde gels are run and vacuum blotted to Hybond-N (Amersham). The blot is then hybridised 
with a 32 P labelled oligonucleotide probe specific to 16s rRNA of S.aureus ( K. Greisen, et a!., J. Clin. Microbial . 32 
335-351 (1994)). An oligonucleotide selected from the group consisting of the polynucleotides of the Sequence Listing 
is used as a probe. The size of the hybridising band is compared to that of control RNA isolated from in vitro grown S. 

50 aureus WCUH29. Correct sized bacterial 1 6s rRNA bands can be detected in total RNA samples which show extensive 
degradation of the mammalian RNA when visualised on TBE gels. 

c) The removal of DNA from Staphylococcus aureus WCUH29 derived RNA 

55 DNA was removed from 73 microlitre samples of RNA by a 15 minute treatment on ice with 3 units of DNasel, 

amplification grade (Gibco BRL, Life Technologies) in the buffer supplied with the addition of 200 units of Rnasin (Prome- 
ga) in a final volume of 90 microlitres. 

The DNase was inactivated and removed by treatment with TRlzol LS Reagent (Gibco BRL, Life Technologies) 
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according to the manufacturers protocol. 

DNase treated RNA was resuspended in 73 microlitres ot DEPC treated water with the addition of Rnasin as described 
in Method 1 . 

5 d) The preparation of cDNA from RNA samples derived from infected tissue 

10 microlitre samples of DNase treated RNA are reverse transcribed using a Superscript Preamplification System 
for First Strand cDNA Synthesis kit (Gibco BRL, Life Technologies) according to the manufacturers instructions. 1 
nanogram of random hexamers is used to prime each reaction. Controls without the addition of SuperScriptll reverse 
io transcriptase are also run. Both +/-RT samples are treated with RNaseH before proceeding to the PCR reaction 

e) The use of PCR to determine the presence of a bacterial cDNA species 

PCR reactions are set up on ice in 0.2ml tubes by adding the following components: 

is 

45 microlitres PCR SUPERMIX (Gibco BRL, Life Technologies). 
1 microlitre 50mM MgCI 2 , to adjust final concentration to 2.5mM. 

1 microlitre PCR primers(optimally 18-25 basepairs designed to possess similar annealing temperatures), each 
primer at lOmM initial concentration. 
20 2 microlitres cDNA. 

PCR reactions are run on a Perkin Elmer GeneAmp PCR System 9600 as follows: 

5 minutes at 95°C, then 50 cycles of 30 seconds each at 94°C, 42°C and 72 °C followed by 3 minutes at 72°C 
25 and then a hold temperature of 4°C. 

10 microlitre aliquots are then run out on 1% 1 x TBE gels stained with ethidium bromide with PCR product sizes 
estimated by comparison to a 100 bp DNA Ladder (Gibco BRL, Life Technologies). 

RT/PCR controls may include ■+-/- reverse transcriptase reactions, 16s rRNA primers or DNA specific primer pairs 
30 designed to produce PCR products from non-transcribed S.aureus WCUH29 genomic sequences. 

To test the efficiency of the primer pairs they are used in DNA PCR with WCUH29 total DNA. PCR reactions are 
set up and run as described above using approx. 1 microgram of DNA in place of the cDNA and 35 cycles of PCR 
rather than 50. 

Primer pairs which fail to give the predicted sized product in either DNA PCR or RT/PCR (approx 20%) are PCR 
35 failures and as such are uninformative. Of those which give the correct size product with DNA PCR three classes are 
distinguished in RT/PCR: 

1 . Genes which are not expressed in vivo reproducibly fail to give a product in RT/PCR. 

2. Genes which are expressed in vivo reproducibly give the correct size product in RT/PCR and show a stronger 
40 signal in the +RT samples than in the -RT controls. 

3. Genes which may be expressed in vivo give similar amounts of product in both +/-RT samples. 



45 



50 



55 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Black, Michael 
Burn ham, Martin 
Hodgson, John 
Knowles , David 
Lonetto, Michael 
Nicholas, Richard 
Pratt, Julie 
Reichard, Richard 
Rosenberg, Martin 
Ward, Judith 

(ii) TITLE OF THE INVENTION: Novel Prokaryotic Polynucleotides, 

Polypeptides and Their Uses 

(iii) NUMBER OF SEQUENCES: 534 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham Corporation 

(B) STREET: 709 Swede land Road 

(C) CITY: King of Prussia 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19406-0939 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE : FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/027,032 

(B) FILING DATE: 24-SEP-1996 
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(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Gimmi , Edward R 

(B) REGISTRATION NUMBER: 38,891 

(C) REFERENCE /DOCKET NUMBER: P50549 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-270-4478 

(B) TELEFAX: 610-270-5090 

(C) TELEX : 

(2) INFORMATION FOR SEQ ID NO:l; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GAATTCCTGA TAATTATTTN GCGATATAAG TCGTCAACGT ACCAATATTA TCAATAGTCA 
CTTTANATNC ATCACCTGGT TGGAAAAATT TAGGTGGTTG AATACCTGNA CCANCGTCTG 
AGTGGTGTAC CAGTTGCAAT AATATCTCCC GGATGNAGTG CAACACATCA TGACATCACT 
ACTATTACAA CAACATATAT CAGAATCAAG TCGCCAGNGT NACCATCTTG TCTAATTTCA 
TNGTGACCTC NTGTAACAAT ATTCACATTT CCAGGTAATG GCAGTNCGTC TTGAGNAANG 
ATTTAAGGAC CCATNGGNTN ANCCNNCAGT TAAACTNNGT GGTAAAAATG CTTGATCTTG 
NTCACTGCGG GCTNTGAGAT CAAGTGATAT CGTAAA 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 696 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CCCCTCTATT GCTTNNTTGA TACGTTTCTG TTCTTTAAAT CCAAGATATA CCGCAAACAA 
CATAACAAGA ATGAAACCTA TTATTTGAAT AGGTATTAAT CCATACCATA ATTCATTGAC 
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10 



15 



ACTTTTGGCT TTTAACACTG CAGCTACACG AGCCATTGGA CCTCCCCAAG GTNCCATGTT 180 

CATAATCGCC GCGCTTAATG CTAATAGTAA AATCAATAAA TATTTATTCA TATTTAACGC 240 

TTTATATAAA GGTAATAATG CAGGAATAGA AAGCAAAAAT GTTACCGCAC CGGCCCCATC 3 00 

TAATTGGGCT ATTGTGCCAA TTAAAGCTGT CATTGCACAG ACAATGACGA CATTGCCTCG 3 60 

TGTCATTAAT ATTAAGCGTT TGACAAGCGG C TTG AAT AAA CCACTATCGT TCATGATGCC 420 

AAAGAAAATA ATGGCANAGA TAAACATAAT AACAACGTNG ATGACNTGAT CTAACCCTTT 4 80 

AGCAAAAAAT CCAACCAAAT CTGNC AC AC T ATATCCTAAA ATCATTGCCC CTAAGCAAGG 540 

TATGATTGTC ATACCTACAN CTGGATTTAT CTTNNTGGGC AATGAG TAAN CCTACAACTG 600 

AAATAATAAT NATGNGCCCC ATCACTGTCA ACCACATATT ATC AC TATTC ATTTATTTTC 66 0 
TCCCTCCACT TNCAATACAA TTTACATTCA CCCCCC 



(2) INFORMATION FOR SEQ ID NO : 3 : 



696 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 588 base pairs 

20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

30 

CCCTTGNCAA TTCATNGGGT TTTCTGTTTA ATATTTTTCG GGTTCTAANC CCACCATATC 60 

AAGTGATTCA TCTACTCTTT TATCAATATC TTTTTCTTCC CACTTTTTCA TTTGTGCCAC 120 

TTTGTGCAAA AATTTCCTTT GGGCCTCAAA TGGGGNATAA TTCCAATCTN CCTCCAANAC 180 

35 GTANCCAATA TCCCAACGCA TTTCGTAATA CTGGATAATC ACTTATTGGT TTATCTTTAA 240 

AATAAATATA ACCTTCACTT AAGTGAATGA GTCGATTAAT CATTTTTAAT GTCGTAGTTT 3 00 

TTCCACAACC TGAAGGTCCA ATT AGO AC AA AAAATTCACC CTCATTAATA TTGAAACTAA 360 

TGTTATCGAC AGCAACATGT TTGCCATAAC GC TTAGTT AC ATTTTTAAAC TTAATCACTT 420 

40 TGCCACCTCT TTTTTTC TC A TAGCATAAAA CCGAGATTAT ATGTATGTAT TC CCTATTT A 480 

ACCACGTTTA TTACAATTTT CAAATTTAAA TGATTTATCC TTGAACTTTT TTAACAAAAT 540 

AATGAATAAT AGGTAATC TC CAGTTAAGAA ATAGTGTTAT TTTACCTT 588 

45 (2) INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 500 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



5 



10 



CGGGCAGNCT 


TTANAATTAT 


TTNAATGATG 


ATTAGCAATG 


CATATACGAT 


TAATTCGATT 


60 


ATGGNTTATA 


TCTNNATTGA 


TTAANTCACT 


TATATTTCAC 


AAGTGTGTTG 


CTTTGGAGCG 


120 


TCAACGATGC 


TATCACATAT 


TACGGCATAT 


GAAATTTTAT 


CTGAAATTCG 


NAAAAAATTA 


180 


GCTCAAAAAT 


TAATGCGCCT 


CCCTTTAGGN 


GTAGTGGAAT 


CTAAGAAAAT 


AGGTGAATTA 


240 


AAAAATATAT 


TTGNCGATAA 


GGTTGAAACA 


ATAGAATTAC 


CTTTAGCACA 


TATGATTCCT 


300 


GAAGTTATNG 


GAAACTTACT 


TGTGGCAGCT 


GCTATCTTCT 


TATACATAAT 


GNTCATNGAT 


360 


TGGCGTATTG 


GATAGGTGNC 


TTANTAGATA 


ACGATACCAA 


TNTCAATTTN 


CGCTTC TAAA 


420 


AAAGTNATGT 


CTTGGATTTT 


AATGAGANAA 


TTTGCTGGAC 


AAANTGAAAT 


CCAATTTNTN 


480 


TTTTTNATTT 


GNCGNAATGG 










500 



(2) INFORMATION FOR SEQ ID NO:5: 



25 



35 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 808 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ACCCTGTTGT CCCCNGGGGT CAATAGACCC CGACATACTT TNNGGCTTCT GAAAATAAAA 60 

TTTCAGGNNC GCACTTTAAA GATGGACGTC GATGTCGACT AATGTGATGA CNGCCTTTCA 120 

TCTTTTTAAA AANTCATTAA TTTCC C TTCT TGTTTAANAC GTACATANAA GAAATACGCA 180 

TACGGTACTA ATAAAATAGT TGTATATGTT GCGTGTGTTA ATAATAATAC ACCGATTAAT 240 

TCAGGAATGA TGTTTAAGAA GTAATTTGGG TGTTTTGGTA ATTTTATATA ATCCAGATTT 300 

AATAATAGGA TGGTTAGGTA AAATGAATAA TTTTAATGTC CAAATACCAC CTAAAGTTTT 3 60 

AATAACCATA AATAACATGA TATAAGCAAA GATTAATATA ACTAAGCCAA TACCATTTGC 420 

AAAGCTAAAT GTATCTTTAT TAATAAATGC CTCTACACCA GCCAATACAT AAATTAAAAC 480 

GTGTGTTATT GCTAAAAACT TCGAATTTTT AACGCCATAT TCAACTGCAC CGTCTGCTTT 540 

TAATTGTTTT GAGTGANTAA TACATATCTT TAAGCTGACA AGTCTGATAC AGAGAAAGAT 600 

AAGTAATATA GATAGANTCA TGATGTCCTC CGTCATTATG TCATATGTAT AAGCGTTGAT 660 

TTTGACAACA TAAAGTATTT TATAGATAAA ACTTGTCACA TACTATTAAC TATTTATTAA 720 

TTTTAGTACA TAAATATGTT TCTAAGTATG TGTTTATGTT CAGTATTTTG GATAATTTAA 780 

TAATTTTAAG GATATTAAGC GCTTACAG 808 

(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 base pairs 

(B) TYPE: nucleic acid 
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(C> STRANDEDNESS: single 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 





CGGAATTCTT 


ATTCATTTAT 


AAAATAATAT 


TCGTATGATT 


TGATGTTTTA 


ATTAGTCCAC 


60 




CATTTCGATT 


TGTGCTATGA 


TAATAGTGTT 


AAGTAAACGA 


AATAAGGGGT 


TATTAAGTTG 


120 




AATAACTACA 


AAATTGGCCA 


ACATATCAAG 


GTGCGTGTAA 


CTGGTATTCA 


NCCATACGGT 


180 


15 


GCGTTTGTTG 


AGACCCCTAA 


TCATACTGAA 


GGAC TGATTC 


ATATATCAGA 


AATTATGGAT 


240 




GACTACGTTC 


ATAATTTGAA 


GAAATTTCTA 


TCAGAAGGAC 


AAATTGTTAA 


AGCTAAAATT 


300 




NTGTCTATAG 


ATGATGAAGG 


AAAGC TTAAT 


C T A TC ATT AA 


NGGATAATGA 


TTACTTCAAA 


360 




AATTATGAGC 


GTAAGAAGGA 


AAAACAATCA 


GTATTAGATG 


AAATCAGAGG 


NACAGAAAAA 


420 


20 


TATGGG TNNC 


AAACACTTAA 


AAGAACGCTT 


ACCAATCTTG 


GATAAAACAG 


GCAAAGCGAG 


480 




CAACTCGAAA 


CNGACTAAAG 


GAACAAGATA 


AATCCGNACC 


CGAAAATCAA 


ACAAAGGGTC 


540 




TTGAAATGAA 


AGTTTCTTAG 


AC TATAAAAG 


AGATTAGGTA 


TCTATTAAAT 


TTTATTAGAT 


600 




ACTAATCTCT 


TTTTGTCTAC 


GATAACGTAA 


TATGATTGAT 


TCTATTTACA 


CGTACAAATG 


660 


25 


GTTTAAGGTG 


ACATATCCAT 


TATCTTTGTT 


AGATAGAATC 


GTTGATTTGC 


AATATTGTAT 


720 




GTGGATTTGT 


TTTTTTTATT 


TATTTTAGAA 


ATGAGAACTA 


CAACTTAAAG 


TATTAAACGA 


780 




ATTGCAACTA 


TATAAACAGA 


TAATTG 








806 



(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1033 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 



45 


CTACGGCATA 


CCATGCGAAA 


TAATTGTTGA 


TTCTAATGCT 


ACAATCGGTT 


GATTGTTCTC 


60 




CCGTGCTTGC 


TGAACTTCTC 


GAGAATACTC 


AATATACTTG 


TGTAAATTTG 


CCATTTTTAT 


120 




AATCCTCCAT 


ATCGTGATAA 


AGTTGCTGTT 


GATCTAGGTT 


GTGCCTAACT 


GTATATTTCG 


180 




TTTCTATCGT 


TTTC TNTGCG 


TTAACCATAC 


CAGCAATTAA 


TATATCTACA 


GTAGACATCC 


240 


50 


CATTTANCCA 


GCTATATACT 


ACTGCAGCAC 


AGAATGAATC 


GCCTGCACCT 


GTAACATCTT 


300 




TCACACTATG 


TGGATGGCAT 


AACTGACTTG 


ATGATTCTTC 


CTCACCACTN 


CGGATAATTA 


360 




ATTCTTNCAC 


GC CATNTGTC 


ACAATAACAT 


TTNTACACCT 


AAATCATTCC 


AGCGGTTTTA 


420 




GCAGCTATTT 


NTANATCAAC 


AAGTAGATTC 


TTATTTNTAA 


NTTTAAGTAT 


GGTTCTGGCN 


480 


55 


CANCTTNANT 


CGGGGATAAT 


CCAATCAATA 


GCATGTAATG 


AATCAGGCAT 


ATTTTTCATT 


540 




TTTGGGGAAG 


AAACCGTGGT 


GATAACTAAT 


TTGATTTGAT 


GTTTCGTGGT 


ATAGGCACAT 


500 



36 



EP 0 841 394 A2 



10 



AAGAAGTTTA 
AATAAGTGTG 
GCTAAGCCAT 
GCATTTTCAA 
CATTTCCCAN 
AC CT AAGTTC 
ATTTGATGTT 
GCCAATACAA 



ATGCCTCTTT 
AACGCTTAAT 
ATGTCATGTC 
ATTGTTGAAC 
TCACTATCTT 
TCAGCAATAT 
TCTGCAACTA 
ACG 



GCCTAAATTC 
TAAAAATTCA 
GCCTTCTTTA 
ATGANCCCAA 
GTCCACTAGC 
TTCTTGCTAC 
AATCTTTATG 



AAATCGACAA 
GGCGTAATGT 
CTAATTAAAG 
ATTCATAAAT 
AGATAANAAA 
GCCACCAATA 
C AC ATAAAAC 



TAATGCACTT 
AGTCANACAC 
CTGTATATGA 
GGTGTGGNCA 
GCGACCGTTT 
GAGCGTGTTG 
TTACGATCTA 



AGC CTTTTTC 
TTCCATATCT 
ACCTGT AC TC 
ATCGTTGAAT 
CGCCAAGCCT 
ACGTTACAGG 
CATTCGCTGC 



660 
720 
780 
840 
900 
960 
1020 
1033 



(2) INFORMATION FOR SEQ ID NO: 8: 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 



35 



50 



55 



CTTACACAAC 

GTAAAAGAAT 

CGCTTTTATT 

AACCATTATG 

NACAAAGCGC 

TGTCAGAGAA 

NATATCAATG 

TTCATGTGTA 

AAAGTTGAAT 

TTAAAAGATG 

C TAT AGGAAG 

GGGTGATCCA 

TAATCTTTTT 

TCAAATTTAA 

AGTATGATCA 

GAACTGAAAA 

CTGANGTGAT 

GTTTACNAAA 

TNTTTCAACC 

AACAACAGNN 



AAGTTGTCAN 

TATTTGTAGC 

TAAATAATAT 

TGGNACACCC 

TATTTTAGCT 

TTTCACGCAA 

ACACACAATA 

AGATGCTTGG 

TGTTTCTGCT 

ATAGTGTGAT 

TGGCTTATTG 

GTGCCGAAGA 

TTAGACGAAC 

TTTAAAAAAA 

TTATCGCATA 

AGAAGTTATT 

TGAATATTGT 

TACAANGATT 

TCGGGTATNG 

CTTGGTTG AT 



AAATTATTTT 

TCATCGAGTT 

TTACTTTGGG 

GTGAATAAAA 

AGTAAAGTCA 

CGTGTTAAGC 

TCAACAGGCT 

AACTATAATG 

TAATAAATTT 

TTTTTGTTAG 

AATAATAAAA 

TTACTAAAAT 

AATTTGANAN 

GGGCAACAAC 

GGTTTAAACA 

CAATATTTAC 

TATCGCGAAA 

CGCACGACAG 

NACCACCTAT 

ATTCATCCAA 



TATGATAATG 

GAAAAACAAT 

GATAATCAAT 

ATAGNACAAC 

ATGCACCTAG 

ACGANCTTAG 

ATGTCACAAC 

TAGAAGGTTT 

TACAGATTTG 

CATCGAAAAA 

GAGTATTGTA 

AG AAGTTC AA 

GGNNANTGGG 

TTGAAGCTGC 

AAGCAATCCA 

AANAAGAAGA 

AGTTANTCGA 

ATAATGGACC 

CATTGANATG 

AN 



ATCGNNCATT 
ATAATAAGAA 
ATACGCTTGA 
AATGTCTCAC 
CGTATATAAT 
AAAAAATGAA 
TAAATC GCT A 
AATTGAAATG 
CTTGTAATCA 
TATTTAAAGT 
TAAAACTTGA 
AAGAAAAACA 
TATAGATATC 
TGACATGGNA 
NTATTTATCA 
GATATCAGAG 
CCATCAAGAT 
TANANTTNAT 
TTTACAGANC 



TACTAGAAAA 

C GAAATTTT A 

GGGCGCAGCA 

ATCACAGNTT 

NTCAATAATA 

GCAACAACAT 

AATACTNGCA 

TCATAATGAT 

CACTNTCATT 

TAAACATTTG 

GAAAGTAACG 

AAGAACGTCT 

GATACATTAG 

GAGGTTC AAN 

TATAAAAAGA 

CAAGCGATTT 

TATGCGGAAA 

CANCAAAAAC 

TTTCTTAGNG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1172 



(2) INFORMATION FOR SEQ ID NO: 9: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



75 


GAAGGCTTGT 


CGTCCCATTT 


AAGCATCACT 


CCGTTTGTTT 


TCGCTTAAAA 


TCCCTTGCAT 


60 




CGATGC TAAC 


AATTGATCAA 


CATCTTTAAA 


TTCTTTATAG 


ACTGATGCAA 


ATCTAACATA 


120 




TGAAACTTGA 


TCAACATGCA 


TTAACAAGTT 


CATAACGTGT 


TCACCTATAT 


CTCGTGAAGA 


180 




CACTTCCGTA 


TGACCNTCAT 


CTCGTNATTG 


GCATNCAACC 


TTGNTTGNTA 


TGTCTTCAAG 


240 


20 


TTGTTGGATA 


TCTAACTGGT 


CGGTTCTCAC 


AAGAACGCAC 


AAGTCCAATA 


AGTATTTTNT 


300 




CTCTTGAAAA 


CTGNTCTCTT 


GCGCCATCTT 


TTNNCACAAC 


TATAAGCTGA 


CTTAACTTCG 


360 




ATATGTTCAA 


ATTGT AGGGG 


GAAACCGTGG 


TTCCCACAA 






399 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 632 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



40 


ATTTATATAT 


AGTAACAAAA 


CTAAATTATA 


AATGACAATA 


TCATATTTCG 


CAATCACTGT 


60 




TAGTAATTTA 


AATATTTTTC 


AAAAACGACA 


TAATCATCAG 


TTTGTTCGAT 


TTCAACGAAA 


120 




TTACTATTTA 


AACATAATGG 


TTTAAAATCA 


TTATTCTCTC 


CAAACAGAAT 


CATCGTAATC 


180 




ACATTAAGGT 


TGTCATAGTT 


TTTTCTTAAG 


TAAGTTGGCA 


ATTGTCGCAG 


CGCACTTGAA 


240 


45 


CC AATAC CAT 


TAAAGCGATA 


TGATTTATCC 


ACTTCAATAG 


CAGCAATGTT 


CATGCGTTCA 


300 




CCAATCTGCT 


CCAATGCGAT 


AAAGCCACCT 


NTTGGCANCA 


GCATCATAAA 


NTTAATGATA 


360 




TNTACATNGG 


CCACTTCTTG 


AACGAC TTTC 


TTCCTCCAAG 


CTTTTAATGA 


CGAACATTGT 


420 




TTNGATCTTA 


AGTCCAAGTA 


TTTGGCCGGT 


CCTTTGCATT 


GGTCCTCTTC 


TTAGTCGNTT 


480 


50 


GATTGAATGA 


TCCAGACGTC 




GATTGGNCAA 


TTTCCGANTC 


TACTACTGNN 


540 




TTTGNTGATA 


AATCAGGAAT 


CACCCTGACA 


TTTTGGGAAT 


GAATCAATNT 


TTCCNGGCCA 


600 




ATTCCAAACC 


TGGTTTTTTC 


CAATAATCGN 


TT 






632 



(2) INFORMATION FOR SEQ ID NO: 11: 
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(i) SEQUENCE CHARACTERISTICS : 
(A> LENGTH: 43 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



15 


GATATACAGC 


ACTCAGAATT 


TTAATGACAA 


TTAATGCAGC 


AGTTAACACG 


ACAACGCCAT 


60 




TAAATGCTTC 


TTTATGC TTC 


ATCACTGATT 


CTCATGCTTT 


CTTCAATGCA 


CTTAACTAAA 


120 




AACTTCAAAC 


TATCAAGCCA 


TTGATTTTGT 


TTCGTTAAAG 


TAATTGTCAt 


TGCATTATTT 


180 




TGAACACCAA 


CCTTCATTGT 


TCTACCTAAA 


GGTTGTGTTG 


CTTTGAACAG 


CACTTCGCCA 


240 


20 


TCAATATTTT 


CAGTGGCTTT 


TACAGATAAA 


TGAATATCAA 


TTATTTTCCC 


TTTATC TTT A 


300 




ATCAACGTGA 


TACCTGAATG 


TAATGCGTGT 


ACTTTTATTT 


CCACTATATC 


AAGCAAACGT 


360 




GCTACTTCAA 


CAGGNTAATC 


CAATTAAACG 


ATCAATTANT 


TCATCTTTAN 


TGTCGATAAT 


420 




TGGGNCAAAT 


TGTTCCAG 










438 



25 

(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: - 

30 (A) LENGTH: 522 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 



40 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 12: 






TACCAAAGTT 


C TTCTTT AG A 


GGCTGGGGAA 


TATNTCCNAA 


CTACATCACG 


ACGCACTCGA 


60 




AACGGTTCCT 


CGTTGGAAAA 


TCGATGTCAT 


TTCGTGTATA 


TTGCACATCT 


ACAAGTTTTC 


120 




TAAGTAATTC 


ACTTCTATCC 


ATTTCCATAC 


CAACTCGAAC 


ACTTACTACT 


AAATCTTTAT 


180 


45 


ATTCTTCAGG 


ATTACCTAAA 


C CATAT AT AC 


AACTTACACT 


AGCAATAATA 


ATTACATCAT 


240 




CGCGTTCAAA 


TAATGCACTT 


GTAGCAGAAT 


GTCGNAGNTG 


ATCAATTTCA 


TCATTGATTG 


300 




AGGCATCTTT 


TTCAATAAAA 


GTGTCAGTAG 


ACGGTACGTA 


TGCCTCAGGT 


TGATAATAAT 


360 




CATAGTAACT 


TACAAAGTAT 


TCCACCCTGT 


T T TC AGG AAA 


AAATTCTTTA 


AACTCACTAT 


420 


50 


ATAATTGCCC 


TGCTNATGTT 


GTGGTAGGTG 


CGATAATTAA 


CGTTGGTTTC 


CCAACTTCTT 


480 




TAATAACATT 


AC TC ATCCGN 


CAATGTTTTC 


CCTCGNGGCA 


GG 




522 



(2) INFORMATION FOR SEQ ID NO: 13: 

55 

(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 520 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 





ATCGCCACCT 


GTACCATGCA 


ACAATATTAG 


CGTTGGTGCA 


CCTTTNTTGT 


CCTTCACGAA 


60 


15 


AAATATGTTC 


CATCACAAAT 


CCTCCTCATT 


CCAATTAATC 


ATG TTAACGC 


TTAGTATTAA 


120 




ATGGTCTTAT 


TTCTGATTCT 


ATATACTCTC 


TTTTATTTTC 


TAAAAATGGT 


GGCAATGATA 


180 




AACCTNCACC 


TAATGTTTCA 


TACGGNTCAT 


CTTCCATAAA 


TCCTGGTCCA 


TCTGNTGAAA 


240 




TTTCTATTAA 


AATATGCCCC 


ACACGTGNAT 


ATAATGCTTC 


AAAATAGAAA 


CGATTAACGA 


300 


20 


TGCCTGAGTT 


GGTTAATACC 


TACCTCTTTA 


TATTNCGGGT 


GCCCACGCTT 


CTTATTTGCA 


360 




TCAGGATCTT 


TCCACAACGA 


AATGACACAT 


GATGT AC CTC 


ACCATAACTT 


GAGGGNGGTT 


420 




GNTGGCCTNN 


ATCATCTTTT 


ATTAAGATTA 


CTGGNCACAT 


TGGCGNCTTC 


GCAACTTCAA 


480 




GGTAATGCGA 


CATATCTTCA 


AGCGCAATAG 


GTGGCATACC 






520 



25 

(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 397 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

40 



GAATTCCATC 


AGTGCGACAA 


CTGAAGTATT 


TANACCAATA 


ATGTCATCGA 


CACTAGTTAC 


60 


TATTATCGTC 


TTCTTACCAC 


TTGTGTTTGT 


ATCAGGTTCA 


GTAGGCGAAA 


TGTTTAGACC 


120 


TTTTGCATTG 


GCTATTGCAT 


TTAGTTTATT 


AGCATCGTTA 


TTAGTGTCAA 


TT AC AC TCGT 


180 


NCCAGCTTTG 


GCAGCAACAC 


TATTTAAAAA 


AGGCGTTAAA 


CGTCGTAATA 


AACAACATCA 


240 


AGAAGGATTA 


GGTGTTGTTA 


GTACAACTTA 


TAAAAANGTA 


TNGCATTGGG 


TCACTAAATC 


300 


ATAAGTGGGA 


TNGGTAATTA 


TATTAAGTAC 


ATTAATTTGG 


GGNTGNAACT 


ATTGTTTTTG 


360 


GGGGGACCNA 


GACTAGNCAC 


TTAGCTTTTT 


TCCAGAA 






397 



50 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 444 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

10 



15 



20 



AACCATCTTA 


CCGATATCAG 


TACAGTACGG 


CGTAGGAACG 


GGCATCCAGC 


GGATTAGAGG 


60 


TGGTGAAACC 


AACC TGGTTG 


TTGATAACGA 


TACGTACCGT 


ACCGCCAACT 


TC AT AACC AC 


120 


GCGCTTTCGA 


CATGTTCAGG 


GTTTCCTGAA 


CCACGCCCTG 


CCCGGTCACT 


NTGGGGTCAC 


180 


CGNGGATGGT 


GATTGGNAGN 


ACTTTGTTGN 


TGCTCGGNTC 


ATCAAGTCTG 


TCCAGACGGG 


240 


CACGAACAGA 


ACCGATANCT 


ACCNGGCTTA 


CAATCTCAAN 


GTGAGACGGG 


TTAAACGCCA 


300 


GNGCCAGGTG 


NACCAGGGCG 


CCATCGGTCT 


NTAAGTCAGA 


CGAGAAGCCC 


ATGTGGTATT 


360 


TCACGTCACC 


CNTGCCGAGG 


TGTTCTNTAT 


GTTTACCCGG 


GGGAACTCCG 


TCGNAACAAN 


420 


TCTTTGGGGG 


TTTTTTACCC 


CAGC 








444 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 





GAATTCCGTG 


GAGGGCAATT 


TTGGGCTTGC 


AATAAACAAT 


CATCAAAATA 


TTGTTGGTAC 


60 




TATAGGTTTG 


ATTAGACTAG 


ATAATAATAT 


GTCTGCACTT 


AAAAAAATGT 


TTGTTGATAA 


120 


40 


AGGTTATAGG 


AATTTAAAAA 


TAGGTAAAAA 


ACTTTTAGAT 


AAAGTTATAA 


TGACATGTAA 


180 




AGAACAAAAT 


ATCGATGGNA 


TNTATTTAGG 


AACAATNGAT 


AAATNTATNA 


GTGCACAGTA 


240 




TTNTTATNCT 


AATAANGGCT 


TCCGCGNAAA 


TTAAGCGAGG 


AGATTTACCA 


AGNAGCTTTC 


300 




C T AAATC AG A 


TGTAGATAAT 


AGGTTCCATT 


ACAGCAAANT 


CNAAAGATTA 


ANTCGANGCA 


360 


45 


CTGAATAANT 


GNNATGACAN 


CCCTCAG 








387 



(2) INFORMATION FOR SEQ ID NO: 17: 

SO (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



10 



GAATTC C TAG 


AAGGAGAATT 


TGTTAGTGGT 


GTTGCAGAAA 


AGCAATGGTT 


GGAAGAAAAA 


60 


GGTTATGAAG 


TAGACTTTGA 


ATTAATGGCT 


GAAATAAATC 


CAACAAGATT 


ATCATTAAGT 


120 


GATTGGCTTA 


AAGTCCAAAA 


CTATAATAAG 


TAATGAATGG 


GGTGCAATCA 


TGATTC AATC 


180 


AATGTGGTTT 


AATTTGCATG 


TGCAAGATTT 


AGAAAAGAGC 


GCACAGTTTT 


ATAAAGCGTT 


240 


AGGATTTAAA 


ATAAATAGAA 


ACCCACAAAT 


GTTAGATAAA 


ATGGTCGGTA 


TTCAAATAGG 


300 


TCAAACAACC 


GCAATTTTAA 


TAGAAAACAA 


GCATTTTCAA 


AATGTAAGTC 


AGCAAAGCCT 


360 


TAATACTGAA 


CCAAATGAAG 


TGATGATTTC 


TCTAGGTGTG 


ACACAATGAG 


AGTTGAC 


417 



(2) INFORMATION FOR SEQ ID NO:18: 



2Q (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

GAATTCCTAT ATACTACTTT GAAAAGGTGT GAGCTTAATG ACAACTTTTA GTGAAAAAGA 60 

AAAAATTCAA TTACTAGCAG ATATTGTTGA ACTACAAACT GAAAATAATA ATGAAATAGA 120 

35 CGTTTGTAAT TATTTAAAAG ATTTATTCGA CAAGTACGAT ATTAAATCTG AAATTTTGAA 180 

AGTTAATGAA CACCGCGCCA ATATCGTTGC AGAAATCGGT AACGGNTCAC CTATACTCGC 240 

ATTGAGTGGT CATATGGATG TTGTTGGATG CAGGAAATCA AGATAATTGG ACTTATCCCC 3 00 

CTTTNCAACT CACAGAAAAA GCTGGCAAAT TATACGGTCC G 3 41 



40 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 351 base pairs 
(5) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GAATTCCTCA GGTATAACAC TTCNCATATC TTTACGAGNA AC TTTTC AAA CTAGATACCC 60 
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CTGCACCAAC CAACGTCCAT AATGGTTGAT AGTAATGATG GTCTGCTGGG TCTATAATTG 120 

CTATTTTCTC TTTTAAGTTN TGATTTTTTC TTAATAGTCT TGATGCTACG GTAACACCTG 18 0 

CTGTACCGCC ACCAATAATA ACAATNTGAT AATGCTTATT CATTTNTTCC ACATCCTTAA 24 0 

ATTATTTTAA TGATTCTGGA AAATCTTGAT AGCCTTCTCT AATATTCACC ACATTTTCAA 300 

AACCTNTGCT TTCCAATATA CCCACTGCAA TTGAAC TTCT AACACCTGAC T 351 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 324 base pairs 

(B) TYPS: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



25 



30 



GAATTCCGTG 


TTAAGAAACA 


AGATTAAAAG 


AGCAATACGT 


GAAAATTTCA 


AAGTACATAA 


60 


GTCGNATATA 


TTGGNCAAAG 


ATATTATTGN 


TATAGNAAGA 


CAGGCAGCTA 


AAGATATGAC 


120 


GACTTTACAA 


ATACAGNATA 


GTCTTGAGCA 


CGTNCTTAAA 


ATTGGCAAAG 


TTTTTAATAA 


180 


AAAGATTAAG 


TAAGGATAGG 


GTAGGGAATG 


AAAACATTAA 


CCCCTCAACG 


CATCCCGAAG 


240 


TCTTACCTCA 


GACAAACGTT 


AGACTGACCC 


TAGGGTTAAG 


ACTTGGCCCN 


AGGGNTATNN 


300 


TAAC TTACTT 


TAAAATGTTT 


TCAC 








324 



(2) INFORMATION FOR SEQ ID NO: 21: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 484 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Genomic DNA 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 





GAATTCCTAT 


TATCGAATTG 


CATGCTACCC 


CTCCTAAAAN 


CCAAATGCTC 


TTAATTTATC 


60 


50 


TACAGATAAT 


TGGTCTTTAT 


CTTTATTTAA 


AATATTTTCA 


ACATATTTAA 


ACAAAACGTC 


120 




TGTTTCTAAA 


TGTAC TTTAT 


CTCCTAATTT 


TGTGGATGAT 


AAAATCGTTG 


AACGCCTCGT 


180 




TTCTGGAATA 


AGATGAATGT 


CAAAACTGTT 


ATCATGCTTA 


TCAAATACCG 


CTAGACTTAC 


240 




ACCATCCACA 


GTAATAGACC 


TTGCTTAACT 


AAC TG ATT AT 


TAATATGTTG 


GCTACATTGA 


300 


55 


ATCGNAATAA 


TTGTGGCATT 


GGTGTTTCAT 


TTATTTGTGA 


ANTGTCCTAG 


TCACTACATG 


360 




ACCGAGGAAA 


AATGTCCACC 


AAACC T ACCG 


NTACACTCAT 


GGAAGNTCTA 


AATTACTTCT 


420 
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10 



20 



GATTGGCGCT TACANCTGCT AATAGGTTTA TTTCAAGGCT TTATTACTTG ACAGGAAAGA 480 
TGCT 484 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 481 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 





GAATTCCTTA 


CTG TATGATA 


AAGGTATTAC 


CAAACAATAC 


TTAAGGGGGA 


TTATATACTG 


60 




TGGNTCAATC 


ATTATATGAG 


T TTTT AG AGG 


AAAATATAAA 


TTATCTAAAA 


GAAAATGGTN 


120 




TGTATAATGA 


AATAGATACA 


ATTGAAGGTG 


CAAACGGACC 


AGAAATCAAA 


ATCAATGGGA 


180 


25 


AATCATACAT 


TAACTTATCT 


TCANATAATT 


ATTTAGGACT 


AGCAACAAAT 


GAAGATNTGA 


240 




GATCAGCTGC 


AAAAGCAGCT 


ATTGATACAC 


ATGGTGTAGG 


TGCAGGCGCT 


GNTCGCACAA 


300 




NCAATGGTAC 


ATTAGATTTA 


CACGACGAAT 


TAGAAGAAAC 


ACTAGCAAAA 


TTTANNGGAA 


360 




CAGAAGCTGC 


AATAGCTTAT 


CAAGNCAGGA 


TTTANTTGGN 


NATATGGCTG 


CTATNTCAGC 


420 


30 


TTGTCAANGA ATNAAAATGA 


TGCT ATTTTN 


TCAGATGAGC 


TTAATCATGC 


ATCAATTATT 


480 




G 












481 



45 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 459 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 





GAATTCCGTG 


TTAGGTGGTA 


ACGTTACAGA 


AACACATACC 


TTTAAATTGC 


CAGAAGATGC 


60 


50 


TGGAGAGCGC 


CAGATGTTCA 


TTATTGATAA 


AAAAAGACAG 


ACGCCGAAAA 


AGTACCCAAG 


120 




AAAACCAGGG 


ACGCCTAATA 


AGACTCCTTT 


ACTTGAAAAA 


TAATGCATAA 


TCCTTTACAA 


180 




CTAACATAAA 


AGGAGCGAAT 


GGATAATGAA 


AAAACCTTTT 


TCAAAATTAT 


TTGGTTTGAA 


240 




AAACAAAGAT 


GACATCATTG 


GACATATTGA 


AGAAGGATCG 


CAATAGTAAT 


G TTGAATCC A 


300 


55 


TTCAAATTGA 


ACGTATCGTT 


CCCAACCCGT 


TATCAACCCA 


AGACAGGTGT 


TTGAACCCAA 


360 




ATAAAATTAA 


AGAACTTGGT 


GAATCAAATA 


CAATGAACAT 


GGGTTTACTA 


CAAACCTATT 


420 
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10 



15 



25 



40 



45 



50 



55 



GTTGGTAAGA CCGATGAGAA GATATGTTTG AAATTATTG 459 
(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

GAATTCCCNA AAGATACNAT NCNATTTAAA TGGATATCAT TCCAAGAAGC TGTAGATAAG 60 

AATCGGGAAC NAATGTCTCT GTTTAATGAA ATTCTTAAGA GTCATCAAAA GACTCCANTT 120 

TTTGATACAT CTAGCTTGCG ACATACTTCG AAAAAATTAT CAACAATTTT AAAAGGGGAT 180 

TTGTAAAATG AATATTGAAA AAATCGAACA AATAATCAAA TTAGTGAAGG AAAATGATGT 240 

TAAGAAATTT AAATATAAAA ATTTTGANGA TGAAATAGAA ATTGAC TTC A CTGACTCGAA 3 00 

TCATTTGGCT GCACACAGGT AATCAATCAA NTCAANGTAT GAACAATAAT GATTTGACAG 3 60 

CTTCAANAGC GAATGATAAC TCCGATGTNT CGACAAATTG ATTATCATGA CATTAANTCA 420 

CCANTGGNAG GTACATTCTT 440 

(2) INFORMATION FOR SEQ ID NO: 25: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

CCCCCGGGCT GCCACGGTAA CTGATGCCTC CGCGGTACTG GCGACAGATT AC AAAATC TC 60 

GTTCGATAAT AATCAGTGGC AGGTCACCCG CCTTGCCAGC AATACCACTT TTACGGTGAC 120 

GCCGGATGCC AACGGTAAAG TGGCATTTGA TGGTCTGGAG TTGACGTTTA CAGGAACGCC 180 

TGCCGTTAAC GACAGCTTCA CGCTGAAACC AGTAAGTGAC GCCATCGTCA ACATGGATGT 240 

ATT AATC AC C GACGAAGCGA AAATAGCGAT GGCGAGCGAA GAAGATGCGG GTGATAGCGA 300 

TAACCGCAAC GGTCAGGCCC TGCTGGATCT TGCAAAGCAA CAGTAAAACG GNNGGCGG 3 58 

(2) INFORMATION FOR SEQ ID NO:26: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 480 base pairs 
s (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 



15 



20 



GAATTCCTAC 


CAGACGTCGC 


AATGGTCTAT 


AATTTCGCTC 


TAGATAATCT 


TGGAATAATA 


60 


TCTTTTATTG 


TTAAATATAC 


TTCTGGTTGT 


CCAAACATTG 


CTGTTGAAAT 


AGCAAAGTAA 


120 


CTTTCTAAGC 


GCCCCATTCT 


CGTTATTTTA 


TTAATAGCGA 


TACCTACATA 


TTTGATAATA 


180 


AATGGTAATA 


CCTTAATATA 


ATTAAAAATG 


CCTATTAATA 


CAGAAATGAA 


CACTAATGGC 


240 


AGTAACACAT 


TTAAAAAGAA 


CGTAAAGCCA 


TTTNTATTTT 


GTATATCTCC 


AAAAACAAAG- 


300 


TTTATACCTG 


CTTTAC TAAC 


ATTTATTAAC 


CCTTCCAAAA 


AATGAGCCTA 


ATGCAGGTAA 


360 


GATTGGTTAA 


ACCAATCGTG 


GTGTTCATCC 


ATTAAATTAC 


AACAATCCAA 


CAACTTTGGA 


420 


TTAAATTATC 


AATTATTAAC 


NGCCNTTTTT 


GGAAGTC TAT 


GTTTTTACGG 


ATCCCAAAAA 


480 



25 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 545 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

40 



SO 



GAATTCCTCT 


TTCACTTCTA 


TGATTTTTTT 


TAAC TTCGAA 


ACGTCAAGTT 


GTGCATCAGG 


60 


TTCTGCTGTT 


ACTTCCATTA 


CATAACCTTG 


AATCGTTCTT 


GGTCCAAAAG 


GTACAATTAC 


120 


ACGCACACCA 


GGTTGGATGA 


CAGATTCGAG 


TTGTTCGGGA 


ATTATATAAT 


CAAATTTATA 


180 


GTCAACGCTC 


TTCGACGCGA 


CATCGACTAT 


GACTTTCGCT 


ATCATTATTG 


CCACCTAGTT 


240 


TCTAGTTCAT 


CTAAAATTTG 


TGCAGCTAAT 


ACT AC TTTTT 


TTCCTTTCTT 


GATATTTACC 


300 


TTTTCATTAT 


TTCTAAAATG 


CATTGTCAAT 


TCATTATCAT 


CAGAACTAAA 


TCCGATAGAC 


360 


ATATCCCCAA 


CATTATTTGA 


AATAATCACA 


TCTGCATTTN 


TCTTGCGTAA 


TTTTTGTTGT 


420 


GCATAATTTT 


CAATATCTTC 


AGCCTCTGCT 


GAAAGGCTAT 


TAATACTGNG 


ATGTTTATGT 


480 


CACCTAAATA 


TTAAGAATGT 


CTTCAGACGT 


TTAAAAGATA 


CTGACAATCA 


CCACCTGGTT 


540 


TTTCA 












545 



(2) INFORMATION FOR SEQ ID NO: 28: 



46 



EP 0 841 394 A2 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



1Q (ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



15 


GAATTCCTNA 


C CATC CAT AG 


C TGTGNAC AC 


AACATGATCT 


TTATCAATCA 


TCATAATGTT 


60 


GCAAAGGGAT 


TTCTTTNATC 


AAACTCAGGT 


GCATATGCTC 


GNACTTCTTC 


CATAGTTANG 


120 




GGCAATTTCG 


TTGGGGTTAA 


AGGG TGTGTT 


GGATGCCCCT 


CTGGNNCCAA 


GCTCTCAGAA 


180 




TAAGTTAACT 


CATCTGTCTT 


ATCACTATCT 


TTTACATGTT 


GTANCCATGA 


AAAAAAGTTA 


240 


20 


ATATCTTGTG 


GTAATCTNGA 


AAATTTAAAA 


CTGTGACTTA 


TTAATGTGTG 


TCTGTGAGAC 


300 


ATGTGNTTAT 


ATGTCACATC 


TAAACTATCT 


CGACTGTGTA 


TTANCTCCTT 


CGNNG TAGTC 


360 




GCTCTGANNA NTAGGGATAT 


CAANGGTCAT 


GACTCAACAC 


ATCTAACAAC 


NCCTTCTTAA 


420 




NGGTTATTA 












429 



25 

(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 
3Q (A) LENGTH: 43 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

40 



45 



50 



GAATTCCTAA 


TGCANAAAAN 


TTCCCCGAGT 


CACAGTACAA 


TTTGCCGTGC 


TTCGGNGGAT 


60 


TACTTTGTTT 


ATTTTTATTG 


TGTANCATTA 


ATAATGAGTT 


GTCTTAACGA 


TTGTCGC TTT 


120 


ATAACTACAC 


GTGCTGTGAC 


ATCTTTTAAG 


AAAAACACAG 


AAGGCTTGTG 


CATGTGATTG 


180 


TAATTGGATG 


NCATAGAGTA 


ATGATATGCA 


CCAGNTGATA 


ATATAGCAAG 


ATAGTCTCCA 


240 


CGTGTGACTG 


ATGAAGG TAA 


TTTAGCGTCT 


TTTAATAATG 


ATATCACCAG 


ACTCACATAA 


300 


TTTTCCAGCT 


ATAGNTACAC 


TGACATCTGC 


TTCTACATCT 


CTTATTTAAC 


AAAGCAATGC 


360 


TTGATACTGT 


GCGGTCATAA 


AGGTGAAGGT 


TCTTGATATT 


GAATCACTCA 


TNCCGCATCG 


420 


AGTTGAACAT 


ATTT 










434 



(2) INFORMATION FOR SEQ ID NO: 30: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 5 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



15 



20 



GAATTCCTAC 


AAGGATTCAA 


AATAAAATAC 


AAGTGTTTTG 


CGAATATTTG 


TTCGTATAAA 


60 


ATCTTGAAAT 


CAAAACGCAA 


AAACGAACAA 


ACGTGCTTAT 


ATTTTTATCG 


ATAATACAAT 


120 


TANTTTGTTG 


TTCTAAGCCA 


ATTTAATAAT 


CTTATGAGCA 


CATNATTTTG 


ACTTCTGTTC 


180 


TTTAATAAAT 


TTCTAGATTT 


AGACATTTTA 


AATTGCTCAG 


TTTTAAATTC 


AAAGTTATTA 


240 


TCATAGCCTA 


TATGGACTAC 


ACCATCATAC 


TCTAATAGAA 


TAATTGCTGA 


TGACACATTA 


300 


AACAATTTAT 


TAACAAATTG 


CACTGCATCT 


TTTAATTGCT 


GCTCAATAGG 


TTTATTAATA 


360 


TCAACAAAGT 


TTTCATTGGT 


GATTAATCAT 


ACCCTTTAGA 


ACATCGGNTT 


AAATCCNCTT 


420 


CTTNCAATCG 


ATGATATTAA 


GCACCATTAG 


TAATACCATC 


ATTTTTACAA 


AAGGGTCANG 


480 


AATTTTCTTN 


ATTACAGCTT 


GGTCAATTAT 


GTGTCANCTT 


GAACCATARR 


DCTNS 


535 



(2) INFORMATION FOR SEQ ID NO; 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 376 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 



40 


GAATTCCGTC 


ATTTCCAGGA 


GGATANGAAA 


TAAGAATTGA 


GTTATTAATC 


AGTGGTTTTA 


60 




TTTTTATATT 


GGCTTCCCTT 


AGCGATTGTG 


GTTGATGGTT 


ATTTAGC TAG 


AAAATGGAAT 


120 




TTAGTTACAA 


ATATGGGGAA 


ATTTGTGGGN 


TCCATTAGCG 


GATAAATTAT 


TAGNTGCAAG 


180 




TGC TTTAATT 


GTNCTTGTGC 


AACTAGGACT 


AACAAATTCT 


GTAGTAGCAA 


TCATTATTAT 


240 


45 


TGNCAGAGAA 


TNTGCCGTAA 


CTGGTTTACG 


TTTACTACAA 


ATTGAACAAG 


GATNTGGTAA 


300 




GTGCAGCTGG 


NCAATTAGGT 


AAAATTAACA 


CAGCAGTTAC 


TATGGGTAGC 


AAATTACTTG 


360 




GGTGGGTATT 


AGGGTG 










376 



so (2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 518 base pairs 
55 ( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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35 



40 



45 



50 



(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE : Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

GAATTCTCAG TTCATGTTGT GGNGACACTT TAAACGGNCT GTGC C AG TAG CGACCGAGTC 60 

ATTTCAAGAA TGACCATTTC ACATTTATAT TATAACACTT GTCGNGCGTA ACTGTATAGN 12 0 

TTTTCAGNNG TGTTTAAAGT TAAGTTATCT ACTTCGCGCT TTCCTTGGCT TAATTGTGAA * 180 

ATTACATATT GCGCTACGCC AGNTTGGTTG GGAATTTGGT AACCTGTTAT ATCACTTNTG 240 

GTCAANNCAA TTATTTTTAA TTTATAATCA CTCATATTAT CTACGTCCAC TCNTTTTATC 300 

TAAACAATAA AAATGTGTTT NTCTCCCCNT AAATAATAAC AATGGCAGGC TTAATAAAAA 360 

CAATATTAAA TACAATTTGC TCTTGTCAAA ATTGAAAACC CCCACATAAT ATTATTTTAT 42 0 

TTAAGNGTAA GAGGGGGGCA ATCAGGGCTC CAGCATAATG TTAATCTTTG TCGGGTGGGG 480 

TTCCNGTCTT GGGTNGCCCN AGATGCCCAC TCTCTAGG 51 B 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

GAATTCCACT AGAAGTTNCA ACATTACGCA TTTTACATTT CGTAAGTGGA GACGGTGGCA 60 

AGCTAAAGAT TCATTATGAA TTATATCAAG ATAATGAAAA AATGGGTTCT TATCAATATG 120 

AAATTAACTA TNAGGAGATA GGCGAATGAA TATTATTGAT CAAGTGAAAC AAACATNAGT 180 

AGAAGAAATT GNAGCAAGTA TTAACANAGC AGGATTAGNA GNTGAGATTC CTGATTTTAA 240 

AANTGAAGTT CCNNNNGNNT NCAAAAAATG GAGATTATGC TACTAATATT GCGATGGTAC 300 

TGACTAAGAT TGGAAAGNNG TNCTCCNCNC TGNNNATTGC TCANGCGGAA NGNTTGNNNA 360 

CCTTAGCTCC NGGACACCNG CACATGNAAC ACAACTNGAC ATTGCCTGGC CCCACGGCTT 420 

CAA 423 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 943 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



10 



15 



20 



25 



AATGACAAAA 


TATTAATCAA 


TTGGCGTATT 


ATACAAGAGT 


AATAGACATA 


TAAAAGTCGA 


60 


ATGTAACTAC 


GTGAGTACTG 


ATTTTATTCT 


TTGTAAATTA 


CAAGCATTTC 


AT ATC AT AAA 


120 


GTTTGAAAAG 


AGGTATATTG 


AAGTGGAGAA 


AAACGAATAT 


ACAGCTAAAT 


ATAATGAATA 


180 


TAGTCAATTA 


TTAGACGC T A 


CTTACTCGCA 


AGCTGTAGCA 


TACCTTTTAA 


ATAAATATGG 


240 


CGCTGTAACC 


GATGATTATT 


ATAAGGAAAA 


ATCATACACG 


CGATTTTTAA 


ATGG AG AAA T 


300 


CAAAAGTATT 


TCAAAAGGAA 


AATACACTAG 


AGCTAGCGAA 


GGATTATATT 


GTCATCACAT 


360 


AAGCGAAGAC 


AAATTCCAAA 


ACTTATCTTG 


ACCTAAGATT 


CATTTCCGAA 


TTTAAGTACT 


420 


CATACAACGT 


CAAAAGAAAG 


AAAACTTAGT 


GTACTGTGAT 


CTTATCGAGC 


ATTTANTTTT 


480 


ACATGCAATT 


ATTACAAAAG 


AATCCAATGG 


CAATTGGGGT 


GTGGGCTGGA 


TTATGTCAAA 


540 


TGATCAAACC 


AACAGTCATT 


GATTGGGACA 


TTAGCGAATA 


TACTCCAAAA 


CCAGCATGGA 


600 


TGCAAGCCAC 


CAAAGCACGT 


GCCTATGTGC 


CTAGAATATT 


AGTAGAGAAA 


CTACTCATTA 


660 


NNATTGACGA 


TATGTTAGAA 


GGAATAGAAA 


TATATGATNT 


CCNTGAGTCT 


AGATAAATGA 


720 


TTAANGGAGA 


TATATTTCGT 


GNTCGNTGAG 


ATTGTGNTAT 


AGATAGNNTA 


ACCCNNTAAT 


780 


TAAATATTAT 


TCTATNAAAT 


AGACCACGCA 


TNCC TATCTA 


TAAACGGACA 


ACGTTTATAA 


840 


ATGAGTTTGC 


ATGGTCTTGA 


ATTGTATTAA 


ATNTCTTNTT 


GGGTTTTAAT 


AGNTCACTTA 


900 


GNGGGTATGT 


CAACATTCTT 


TNGACAGCTT 


CACGTGTATT 


TTT 




943 



(2) INFORMATION FOR SEQ ID NO: 35: 

30 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1119 base pairs 
<B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



45 



SO 



55 



CAGATATACC 


TGCACTAATC 


AAAGTTTCAC 


CAAAATTTAA 


AGCTGTGTGA 


TAT ATC AC AA 


60 


ATCCACAAAA 


TCCTAAAATA 


AAAATAACAG 


GGATATCTCT 


TAGTTCAGGG 


GTAGGCAATT 


120 


TCTTTATAAT 


TACGAACGGC 


AAGAGAATTA 


TTGTTGCTAA 


AATTAAACGA 


AATGCCGACA 


180 


ATGATTCTGC 


ACTAAAATCA 


TTTAACGCAA 


TCTTTATCAT 


TGGAAATGCA 


GATCCCCACA 


240 


ATATGATAGT 


AAATAAATAT 


GATAGAAAAG 


TAGTGTCTCG 


AAGTTTATTC 


ATTAATATCA 


300 


TCACTCCTTT 


AATTATGTGT 


TTCTATATTA 


AAAAATATGA 


TTTAAAATGA 


GTACAACCAA 


360 


TTGGGAATGG 


ATTTACCTAT 


CCAATTTTAA 


AAGGGAGGGA 


GAAGATGGCT 


AAATATTAAG 


420 


ATATTGCTAG 


TGACATAAGA 


GATAAAATAA 


TCACAGGGGG 


ATTGGGTTTT 


ATGGAATGAA 


480 


GATACCTCCA 


CAGAGGCAGG 


TGGCGATACA 


AATACAACGT 


NAATAGAGTG 


ACGATTATTA 


540 


AAAGTATTGG 


AGTTATTAGA 


GGCTGAAGGA 


TTT AC TAT AC 


TAAAGTGGGG 


AGTGGGNACA 


600 
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10 



TATGTTAATG 


ACTATTTGAA 


TGAAGCACAT 


ATTACAAATA 


AGTGGTCTGA 


AATGATGTTA 


660 


TGGTCCTCTC 


AACAAAGAAG 


TCAGTATACG 


GTGCAATTAA 


TTAATAAAAT 


TGAGACAGAT 


720 


GATTCGTATA 


TACATATAAG 


TAAAGGTGAA 


TTGGGTATAT 


CGTTAATGCC 


ACATATTCAA 


780 


TTGAAAAAAG 


CCATGTCTAA 


TACAGCCAGT 


CATATTGAAG 


ACTTATCTTG 


TGGGTTATAA 


840 


TAATGGCTAT 


GGTTATATCA 


AGGTTAAGAG 


ATATTATCCG 


GNTGAACGAA 


TTGTCAAAGC 


900 


AAGGTATTAA 


ATTGTAGGTA 


GAGGAAAATT 


GTAATGGATC 


AACTCCAGGC 


GCTTTTACAA 


960 


TGCCANACCA 


ACTTNTTATC 


TTATTGGGGT 


TTNTAGGGTC 


AAAGATGCCA 


TAATAATTTC 


1020 


GGATACAACC 


ATCATATTTT 


CCACTCTACA 


AANTGTTTGN 


GGAACAAATG 


GAATTTTAGA 


1080 


CATATTGATG 


TCCCCNTATT 


ATCAAATTTA 


ATGAAATTA 






1119 



(2) INFORMATION FOR SEQ ID NO: 36: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 600 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 



30 



35 



40 



AAGAAATGTT 


CTAGAGATTT 


AGGTATCGAA 


GAACCCAGGA 


TTAAGACGAT 


TAAAAGAGCN 


60 


ACTATGATAT 


TAGGATTATC 


AACATATTTA 


CTGCTGTGTG 


CAAGAAGTAC 


GTGC TTGG AC 


120 


AATTTAAACC 


AGGTATGACT 


GCACCTCAAT 


GTGCTGGTAT 


CATTCATACT 


GATTTTGANC 


180 


GTGGATTTAT 


CCGTGCCGGA 


AGTAACAAGT 


TATTGATGAC 


TATGTACAAT 


ATGGCGGCGA 


240 


AAGTGGCGCT 


AAAGAAGCGG 


GCAGACACCG 


ATTAGAAGGT 


AAAGAATATA 


TTATGCAAGA 


300 


TGGCGATATC 


GTTCATTTCA 


GATTTAATGT 


ATAAACGATA 


GAG TG AAGTT 


AATTAAATAG 


360 


TATATATGTA 


GAAGAGGCGG 


AATCAATTGT 


TCGCC TCTTT 


TAATTATGCG 


TATAATTTAT 


420 


TAAAAGAATG 


GAATGATTTT 


ACTCGCGTAA 


ATAATATCTT 


GAGTGCTGAA 


AAATTGTTTG 


480 


CCTTCGCCAG 


TATAAGCAGG 


CTCTAAAACA 


AGATTAGCCT 


TTGCACAATA 


AAGCCATTCA 


540 


GGATGAATGC 


CACTATTAAG 


TATCTCTTGG 


AATTCGATAT 


CAAGCTTATC 


GATACCGTCG 


600 



(2) INFORMATION FOR SEQ ID NO: 37: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 479 base pairs 

(B) TYPE: nucleic acid 
so (C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
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20 



GAATTCGNAA ATGTTGGCTT TGGCGCAATG AAGAAGCAAC CACCNTCTAA AAGCTAAGTT 60 

AANCTTATTT GAACACATGA TNNCGANTGC ACGTACATCA NCAGAACAAG GAATGCTTGT 120 

NTCTGCGGNA AC AATGC TC A NNCANAGTAC GCTACCCACA TTTACGAAAA ATACATCACG 180 

AAGTCGTTCA TGATATGAAA TGTAGCAGAC ACGAGATTCA CCCTNTGTNT CCAGCTGACT 240 

TCGCAAGTCA NNNAAATGTG TTAACTCTAG CGGAAATGAA AAGGACAATT CATGATATAT 300 

TGGATTTCAG AGATGAAGAT ATTTGGGNTT GTTATTTGGG T AC TTTGGCT GTGTCTCCAN 360 

GCTTAANGGA TGATGANGGT CANNNGTTAC TATCACGTNT ANGAAATGCT TACAATNNTT 420 

ACGANTAGAA ATGACCCGGA TGGCGNGGNC NGNTAAATTG TCAATGNTTT NAAAGGTAA 479 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 733 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



30 


GCGAGCTTAG 


TTCATTTTAA 


AGT AAAGAGT 


TTTTTGTATG 


TAAAGTATAT 


TAGTAAAACA 


60 




CAGATTACTT 


GTCTCTGTGA 


AATTAATACG 


TTTATACTAA 


AAGATATAGT 


CTTTCTAGAA 


120 




AATCTGTTAA 


TTAATTGTGA 


ATTTTTAGAA 


AATTTATTGA 


ACAGCAAAAT 


ATGGATTGTT 


180 




ATAATTTAAG 


TTAAACAAAT 


TCTTATACAA 


TATTATTAGG 


AGGCAATCAC 


CATGTCACAA 


240 


35 


GCAGTTAAAG 


TTGAACGACG 


AGAAACATTA 


AAACAAAAAC 


CAAATACATC 


TCAACTAGGT 


300 




TGTGGGTAAA 


TATTTTACTG 


ATTATATGTT 


GAGTTATGAT 


TATGATGCAG 


ATAAAGGATG 


360 




GCATGATTTG 


AAGATAGTAC 


CTTATGGGCC 


TATTGAAATT 


TCACCTGCTG 


CACAAGGTGT 


420 




TCAATTATGG 


TCAATCGGTA 


TTCGAAGGAT 


TAAAAGCATA 


TAAAAGAGAT 


GGGGAAGGTT 


480 


40 


GCACTTTNCC 


CCNNCCCGAA 


GAAAATTTTA 


AGC GTCTTAA 


T AAC TCGTT A 


GCACGATTAG 


540 




AAATGCCTCA 


AGTAGACGAT 


GCAGAATTGT 


TAGAGGGGCT 


AAAACANTTA 


GTTGATATAG 


600 




AAAGAGATNG 


GNTTCCTGAA 


GGGGAAGGTC 


AATCATTATA 


TNTTCGNCCA 


TTTGGTCTNG 


660 




CAACAGAAGG 


GGCACTNGGC 


GTTGGCGCAT 


CACATCAGTA 


TNAAATTTAT 


TAAATTATNT 


720 


45 


TATCTTCCTT 


CAG 










733 



(2) INFORMATION FOR SEQ ID NO: 39: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1263 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



10 



15 



20 



25 



30 



ACTTGTGCAC 


TGCCAATAAT 


AATTAATCCT 


TTCATATTTG 


ACCCCTCTTT 


TTTTTATTAA 


60 


AC TTG TTGTT 


CACGTAATAA 


TTTCATAATA 


GCTTTTCCAA 


CACCACTATT 


TTCATTCGTA 


120 


TC TGTGAC AT 


ATTTCGCTAT 


TTTTTTAACT 


TCTTCTGCAC 


CATTTTCCAT 


CGCAACTGGA 


180 


TAGCCAACTT 


TC TCTAAC AT 


TGATAAGTCA 


TTTAAATTGT 


CACCTATTGG 


CATGACATCT 


240 


TTCATTTCAA 


TC C CTAATCT 


TTCGGCAATT 


GTTTCTAGCG 


CAATACCTTT 


NGGTGCATCT 


300 


GAATGCGTTA 


TTTCTATATT 


TCCTCTCGAA 


GATGATGATA 


TAGCTAAATT 


CGGAGATTCA 


360 


GCTAAAATTT 


TACTAGCTTN 


GGCAATTTTT 


TCTTAAATTT 


CCATCAAATG 


CTAATATTTT 


420 


CATAATTAAT 


TCAACAGGTA 


TGTTTTCAAT 


AGGATCATAA 


TTATCAACAA 


CTTTCAACGG 


480 


TNCCAATATC 


TATGCG TC TT 


TGNATACCAT 


CTTNAAT AC G 


CTCAACGTNT 


GGATGTTGAC 


540 


CTGCACGCTC 


AGCAATATCT 


ATGTAAATGT 


CTAAATCTCT 


TGGGGGATCT 


TCCAGTATAA 


600 


ATCGCACGAC 


TCGGNTATNC 


TGGATAATAA 


GTACCTGNAT 


CTTTTAAAAC 


ATNTGTAATG 


660 


NGGTGTACTA 


ACGATTTATT 


ANGGNGTGAA 


GTGCTCATTA 


CATTGAAAGT 


TTCATCACGT 


720 


TCTTCAGNAC 


CATTCAGACA 


AATATATGGT 


ACTGNTAAAT 


CTGTGTCAGC 


AANTGGTGCT 


780 


TGTGCTTCAA 


AAAATGCTCG 


ACCTGGCACG 


AGCGATAACA 


ACCGTTATCC 


CTTGTTCTTG 


840 


AGCGTATTNA 


NTCGCATCAA 


TATTAGGTTG 


AGAAATTTCA 


TNTGCTGCAT 


TAAGGTAGCG 


900 


NGCCATCCAT 


ATCAGNGGNT 


ATTAGG TTTN 


TCANTATGTT 


NCCNCGCCCN 


CGNC TTTCGT 


960 


AAATTTAAAA 


TCTTGTTCTT 


AAATAAGAAT 


AT AT AC TC AG 


CGCACATACT 


TTTCTATTAA 


1020 


CATTTATATT 


GTCATTAATT 


TATCATATAA 


TGTAATTCTA 


ACAAATTTTA 


ATTAGTATGT 


1080 


ACTATCGTCT 


TAATTGGTGG 


ATTTCTTATT 


GGCTCTTAAG 


TTTTTAAAAA 


ATG TTGGTT A 


1140 


ATAATGGTGC 


TACATGCTTC 


TTTTAAGGTA 


CAACCTTTAT 


CAACAAATTG 


GACGGATGAT 


1200 


TAAAAATTAG 


GTTNGGTTGG 


AAATAAATTC 


AATTAAACTT 


GCTACTACAA 


ACCACCRRDC 


1260 


TNS 












1263 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 594 base pairs 

40 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



55 



ATTATGCCTA 


TTTGACACAT 


TATTGAAATT 


TTTAATATAC 


CCCTTATACG 


TTACGGCTTT 


60 


GTACTGTGTT 


TAGGTATATA 


AAGCTTTGAC 


ATATCGATAT 


TCTCCAACTC 


TAACAGCTTA 


120 


ATTTTATTAT 


TAATCGTTCC 


ACCGAATCCT 


GTTAAGCTAC 


CCGTTTTACC 


GACAACACGA 


180 


TGACATGGCA 


CGATAATTGA 


TAATGGATTA 


CTTCCGACTG 


CACCTCCAAC 


CGCTTGGGCT 


240 


GACATTTGTG 


GCTTATCAAG 


CACCTTACCT 


ACTTTTGTGG 


CAATAGCACC 


ATACGTTGTT 


300 
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AGAGTCCCAT AAGGAACCTG TCTTAATTCA TTCCAAACAC ACTGTTGAAA ATGACTACCT 360 

GTTGGCTTTA AAGGTATTGT GATTTCAGGA TTGTCAACCT TTAAAATACG CGTCTTAACC 42 0 

AC TGTGTC GN CTCTCTAAAT ATCGCTAAAG ACGTATTTTC TTCCCTAGTA CCAACACCTT 480 

GTTGATTTTC AAACAAACAG CGGTCAGATT ACCCATCACT CAAAAGTCCA ATCGTCTACA 540 

GCGATCATAG TACTCTATCT CCATAAATCC CCCTTTTCTT ATCATACGCC TATG 594 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 511 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 

AATGGGAAGT TCAATACCCA TAATGNCAAG CACCCTTGCA TTTAATTGCA TTTTCCNACC 60 

GTTATTACNG GCCCCATCTC CAACAATATG TAAAGTTAAT ATCTGCATAA TCTTTTTTTA 120 

AAGACTTCTT TTGCTGTTAA GAAATCAGCA AGANCTTGAG TAGGATGATC TTCATCAGTT 180 

AAACCATTCC ACACTGGTAC GCCTGAATAT TCAGCTAAAG TTTCTACTGT TCTTTGTGTN 240 

AAACCCGCGG TATTCAATGC CATCATACAT TCCACCAAGC ACACGTGCAG TATCTTTAGT 300 

TGTTTCTTTT TTACCCATTT GTGATCCAGT TGGGCCTAAA TAAGTTACAT TTGCACCTTG 360 

ATCATGCGCT GCAACTTCAA ATGCACATCG CGTTCTTGTA GAATCTTTTT CAAATAACAG 420 

TGCAATATTT TTATTTTTTA AC ATAGGC TT TTCAGTGCCA ATATATTTAG CACGTTTTAA 480 

ATCC TCGGAG AGNGTTAATA ATNNGNAATT C 511 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 546 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

so (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

CTGCCTTTTC TC TATCTG AT AAACCACGCT TTATTAAAAT TAAATCTCCT AACCATACCC 60 

CGTCTAAATT ATCTGGAAGT ACATCAGCCT CTCTTACTTC AATATAATCA TGTTGTATTA 120 

55 AAGTTTCTTC ATATAATCCC ATCTGATACA TCCTTTACTT ACGTTTACTT CTTATATAAT 180 

CTGCATAATC TAAAACTCTT TGGCATTCAT CATCTGTCAA TTCTCCTTC A AGGTGAGCTG 240 



30 



45 
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CACGATGTTG TACTTCGNTT TCTGGTTGTC TATTTNNTAA TAGTAAATAT TCTGGGGTAA 300 

CTTTCAATGC ATTGGC AATT TCAGCTATAT CCTCCATAGG TATTTNTCTG CTACCGNNTN 360 

CGTATCGGGA TAAGGTAGAT TTATTGACAC CTATCTTAGG TGGAAAATCA GGTAAATTCA 420 

CATTATTTNC TTTTCGGAGG TTGGGTGATT AATTTANCTA TTTCCGCTGA AGTNCTCATA 480 

TTCAAATTTA CTCCCGTTTA TTTATACAGT ATAATAACAC TTCCCATATT AGGAAACACT 540 

AGCATT 54 g 

(2) INFORMATION FOR SEQ ID NO: 43: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



30 



35 



TTATATTATG 


TTCTGTTGGA 


CCCAGTAAAA 


TGTTGCAATG 


TATGATAAAA 


AGCCCTTGGT 


60 


TCAATTAATC 


GAAAATCGAC 


ACCAATCGTT 


TCATCATCAT 


ACGCTAATTC 


TAATGAAACT 


120 


TGTGTGTTCC 


CCACTGCAAC 


TTCTTTTACT 


TTATTGGGAA 


TATTTAATAA 


TTGCTGCTGC 


180 


ACTGCTTGTG 


GGTTATCGGT 


ACTTATTATG 


AAATCTAAAT 


CTTTGCTCAT 


TTCTTTAAAA 


240 


CGACGGAAGC 


TTCCTGCAGA 


TGAATATTGA 


TCGATATAAT 


TTAATGTATC 


TATATAATCA 


300 


ATGATTTCTT 


GATTAAGTCT 


TCTCATTGGA 


TCAATTGGAT 


ATCTATCTTT 


CTTAGCACAG 


360 


NGGGGTTTCA 


CAGCTTCTAA 


TATGGTGTGG 


TCCGNTTTCT 


TTAGCAAATC 


CGCTTAATTC 


420 


ACTTAACTTN 


TCCCAATNTC 


ACAANCAACT 


TGGAAGTGAC 


GCTTNTATCA 


ACAAATATTC 


480 


CAACTCTTTA 


TATAGCTTAG 


CAATTTCTTG 


GTTCCAAGTC 


TGGAANTTTC 


AAAAGTGGAT 


540 


TAAGACCTCC 


CGGACTTCTT 


CCCGGTAATG 


GT 






572 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



50 (ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

55 GGATGCACTC TGTTAGCATT ATCTTTAAAC ATTAAATATG GATAACCTGA TTGTAATTGT 60 

GTNTGCGCAA TTAAATTCAA CATTTCACGC GCATTCTTTT TCTTTTTCTC AACATTTGGG 120 
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TTTGCAACCA 


TGTCATCATA 


ATATTTTTCT 


AAATCGATAT 


CGTCTAATGT 


CACACCATAT 


180 


TCTTCTTTAA 


CTGTATGAGG 


TGCAAACATA 


TAAAAGTCCT 


TACCTTCTTT 


AGCTAAATCG 


240 


AAGAATTTAG 


ATGGAACAAT 


TAAACCAGTT 


GATATTGTAG 


ATAAACG TAA 


ATCTTCATCC 


300 


GCATTTACTT 


NNTTAGTATC 


TAAAAATTCT 


TCTACATCAT 


AATGGAAGAT 


ATTTAAGG T A 


360 


CACAGCACCA 


GCACCAGGGC 


GGTTGGACAA 


GTTGGATCTG 


CATAGCTAAA 


GCCACCTTCA 


420 


AG TGAC TTAG 


CAATAGGTAA 


AACGC CTTNC 


GCTACGCCTT 


TAATTCCTTT 


AATTGCTTCA 


480 


CCACGTGCAC 


GCAATTTAGA 


TAAGTTAATT 


GCAACGCCGC 


CCCCAATTTT 


ACTTAATTGG 


540 


TNGNGCAAGG 


NGGANTCAAT 


AAAGTTAANT 


TGAATTTAAG 


NTGNCAACCC 


AC TTC TTNNA 


600 


TNNGAAACAT 


GACACTAGCT 


CACCACGACG 


CGCACGGNCT 


GCGTTTAAAA 


ATGTTGGTGT 


660 


CGCTGGNTGA 


TATCTTTGTT 


CAACCATAGC 


AGAAATAAAT 


TGTTTAGCTT 


GTGCTTTATT 


720 


ACCATNTGCT 


AGGTATAAAG 


CAACAATGGC 


AACGTGTTGA 


TTATAGTCTT 


CTAAGTATTG 


780 


ACTTTTATCA 


TTTGTTTNCA 


AAGCGTAATC 


TTTGAAAAAT 


TTACTAGCTG 


ACATATAACT 


840 


TGCAAAATTA 


AACGAGATTG 


ATTNTGGATA 


ATCAGTGATT 


TCAATTAGAT 


CCNCTTCACT 


900 


ATAAATATCA 


AACACATTGA 


AATAAAAATC 


GGTGTCTTCT 


AAATAACGTA 


AAACGCTCGA 


960 


TTTCAATATT 


CGAAGAAGAT 


NGCGTGGGCT 


TTTTACTTCT 


TTC TTAAATT 


AAGCTTACNT 


1020 


AAAGGTTCCT 


TGGGGCTTTT . 


TTCTTAAACT 


TAAAGGAATC 


CCATCTTCCT 


CTTCCCGGTT 


1080 


NGGGGACCCC 


AAATATTTTA 


CTCAAATATT 


GATGG 






1115 



(2) INFORMATION FOR SEQ ID NO: 45: 

25 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1123 base pairs 
^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 



40 


TTCCATGGAG 


GCATGCGCAC 


AATCTGCATT 


GTAATTCATA 


CCAAAATGTT 


CACCGACTCC 


60 


CAACCGAATC 


CACCGAAAAT 


GCCAACATCT 


AGAATCGGTT 


GCACATCATG 


TTCAACACAT 


120 




TCTTCGTGCA 


ATAAATTAGA 


GAGTTCATTA 


TTGATGCCAT 


AACCGATGCT 


AATTG TATCN 


180 




CCATAAGTTA 


AAAACTGAGC 


AGCACGTCGG 


AGAATCAATT 


TCCGACTANT 


AAAAGGTAAT 


240 


45 


GCGGGTTCAG 


GTATTCCATC 


AATTCGTTCT 


TCTCAAGACA 


AGGCTGGTAA 


AT AATG AC TC 


300 




TGAATTACTT 


GGCGGTGATT 


CTTTTCATCT 


TCTGTGACGA 


ATACATAATC 


GACAAGATTT 


360 




CCTGGGATAA 


CAACTTCATT 


CGGTTTTAGG 


TGATAGTCGT 


CAACTAAAGC 


TTTAACTTGN 


420 




ACAATAACTT 


TCCCATGATT 


GGCTTTCGCG 


TTTAATGCGA 


CATGATAACA 


CTCGCTCAAG 


480 


50 


TACGCTTCTT 


GAGNTAAATA 


AATGTTACCT 


TGTTGATCTG 


CGTATGTTCC 


TCTCAGTAGT 


540 




GCCACATCAA 


CGCTAGGGAA 


TGTGTAATGT 


AAGTATGTTT 


CATCGTTGAT 


GGTTACTAAT 


600 




GAAACTAAAT 


CATCAGTTGT 


TCGTGTATTT 


ACTTTACCGC 


CACCGTATCT 


AGGATCAACA 


660 




GCTGTGTTTA 


ATCCGATTTT 


AGTAATAACT 


CCAGGTAATA 


ATTGATTACT 


CTGACGATAA 


720 


55 


TGAGTTGCAA 


TGATACCTTG 


TGGTAAAAAA 


TAAGCTTCAA 


TGTCATTATT 


TTTCATTGCT 


780 




TGTGCCGTTT 


TGGAAGAAGC 


CGTTAAAATG 


CTCATAATGA 


CACGTTTAAT 


CATGCGACGT 


840 
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TCTATAAAAT CATC T AAATC C GGTGCGGC A CCTAAACTAT GGATATCATT CGCTAATATA 900 

AACGTTAAAT CTTTGGGCGT ATGATATGTG TCATGTTGCG CTAACACAGC ACGTAGAACT 960 

TCGGCGGG T A AGTTGGC T AC AGCTAATGCT GGTAAACCAA TCACATCACC ATCTTTAATG 1020 

ATATGTTGTA AGTCGTGCCA TGTGATTTGT TTCAAGCAAG TCACCTCCAT CACATTTGAT 1080 

AAAATATAGC GTTTTTACAC TTTGTGTAAA CCCTTACAAG AAG 1123 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 960 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



25 



30 



35 



40 



CTCCATTTGC 


ATTTGGGTCT 


TTAAAACTAA 


ACACATACTT 


AT AGTTGC C A 


TTATGTTCTT 


60 


CAATAGAATA 


ATCATTATAC 


ACTTTATTAT 


TACTACCAAA 


TTTATTTGCT 


TCATTATTAG 


120 


CCGCATTTAA 


AGCTGTTTGG 


AAATTTGGCA 


ATTGCTGTAA 


AGCTTGAGTT 


TTATTTCCAT 


180 


TAAACGGATA 


AATTTGACGT 


GCAACCGGCG 


CGGCATTTTG 


TCCATAATAT 


GGTGCAACGT 


240 


AACTTGATTN 


TTGATTATTA 


CTTGATTGGG 


TTATTATTTG 


TTTGGGTNTT 


GGGTCATTGT 


300 


TTGTTGCATT 


TGGAATTAGA 


TTGTTGCTGG 


TTATCGTTTG 


CACTATTATC 


TTTATTATCT 


360 


TTGNTTACGT 


CTTNACTATC 


ATCTTNATTG 


TCTTTCTTAT 


CTTNAGATGA 


ATCANTTGNT 


420 


TTNTTATCTT 


GTTGNTCAGT 


TCTCGCTTTA 


TCATCTTTTT 


CTNNATTACT 


GTCTTTTTGG 


480 


TGGNGCACTA 


TCTTGACCAC 


ATGCAGCTNA 


AAATAATGAT 


AATGCTAGTG 


GACCCTGTAN 


540 


CTAATCTTTT 


CATACATATC 


TCCTCCTATA 


ATTCGATATT 


CATTGAATAA 


GCTTGAAATA 


600 


CATATCTNCC 


ATGTGTATCA 


TTTCATGGCT 


TGTTACCANA 


GACGTCAACA 


TTAATATGAT 


660 


TTGANTTATC 


ACAATACCCA 


TAACTACAAT 


CTAAGGTTCT 


TTTTATTATA 


CCCTAATTTT 


720 


TGTTCATTAT 


TATTTAATTT 


TTGTGAATTT 


TATGTTTTCT 


ATAAATTTAA 


TTATTTTACT 


780 


TTAACAATTC 


ATTACGCATT 


TAGCATTTCA 


AGGCATACAC 


AATAGGAATT 


CCGATGATTT 


840 


CATTTTATCT 


TGCTGC AAAA 


AACAATCATT 


ATTACTTCTT 


TTTTCCATAA 


TTAAATCTTG 


900 


TATCCCGTTA 


CATCACCTGG 


TNTGGAAATG 


ATGTTCAACC 


CAACCACTTT 


TAAATCCAAG 


960 



45 

(2) INFORMATION FOR SEQ ID NO: 47: 



(i) SEQUENCE CHARACTERISTICS: 

50 (A) LENGTH: 1130 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 



10 



15 



20 



25 



30 



35 



45 



50 



55 



TTGGCCCTTT 


GGAGANTCCG 


GATTATTGAT 


AAAGATATTC 


CTAGCGTTCG 


CTAAGCATAG 


60 


TTATGTGATG 


GANAATAGTC 


TTGTTGAAGA 


GTTGTTCAAT 


ATTGCCTCGN 


CTGTCGCTCC 


120 


AAGTAATGAT 


AAGCAGGGTG 


TTTTANCGAT 


TATCGAACAA 


GAAGTATTGN 


NCAAGTAGGT 


180 


AGCACATCAT 


CAATGAAGCG 


GCTATCATGA 


TACCTAAATA 


CATGTTNNAT 


GNAANCTTTG 


240 


NATACCGAAT 


CATCNAACAT 


TAATATATCT 


CCAAACAGGT 


TAGCCAATAA 


TCCCATAGTA 


300 


AAATAAGCCC 


AC TAC AATC A 


GTTGAATAGA 


TGATAAAATA 


AAGCAGATTT 


TAATCAANCC 


360 


GATTTTGATA 


CAGATAAACA 


ANATANNAAN 


CAAAGGCACA 


TGGTGCGCCT 


GTGCTTTTAA 


420 


ATATATATTA 


ATTTAAACGT 


GTTTCACATG 


TACCCAAGTT 


GTTAATNGAC 


AAGATAATTG 


480 


CTGCGTTTAA 


ANCCCACCTT 


CCAACAAGGT 


TTTGTACCTG 


CTTCATCAGA 


GAAGAAACCA 


540 


ATATGTGGTG 


TTAC TAAAAT 


TCCTTTCATG 


TTCCGATTAA 


CTCTAATAAT 


GTTTTATCGT 


600 


CAATGTCTTT 


ATTAGTC C AG 


TCATTTGTGA 


AGTATGCAGC 


TTCATTTTCA 


TAAGTATCAA 


660 


TCGCAGCACC 


TAACAAAGTA 


CCATCGTTCA 


CTGCAGCGAN 


TAAATCAGGT 


GTATTGATGA 


720 


CTGCACCACG 


TGCTGCGTTA 


AC T AAGATTG 


CACCTTTTTT 


CACATGATCA 


AACATTGCTT 


780 


TATCGAATAA 


ATGATAGCTT 


TCTTTGTTCG 


CTGGAACATG 


TAAAGAAATA 


ATATCGGCAT 


840 


CTTTAATAGC 


TTCTTTAACA 


CTATCTTTAT 


AAGTTAAAAA 


GTCTAAATCT 


TTATTAGGAT 


900 


AAGCGTCATA 


AGCTGTAATT 


GTAGCACCAA 


ATCC TGC AT A 


TATTTTAGCT 


GTAGCAGCAC 


960 


CGATACGACC 


CGTACCGATA 


ATTGCAACAG 


TCATATTTTT 


AACTGGTTTA 


GACATGATTT 


1020 


CTGCTTGCCA 


AGTAAAATCA 


TGTGTTTGTA 


CACGGCGTTC 


AATATCTGGG 


AAGCGACGCA 


1080 


CTAATTGTAG 


GGCGATAGAA 


ACAGAATACT 


CTGCAGTGTT 


TCAGGGGAAG 




1130 


(2) INFORMATION FOR SEQ 


ID NO: 48: 








(i) SEQUENCE CHARACTERISTICS: 








(A) 


LENGTH: 573 base pairs 








(B) 


TYPE: nucleic acid 










(C) 


STRANDEDNESS : single 










(D) 


TOPOLOGY: 


linear 










(ii) 


MOLECULE TYPE: Genomic DNA 








(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 48: 






GCAGTATCAA 


TAGTAGTTGG 


TTCATCATTG 


CCCAACACCT 


TTTTCAAATT 


ATCATTTTCA 


60 


AGTTGAAGCG 


CTACATTTTC 


TTCTACTAAT 


TCAACTGCAA 


GTGCCTTTAA 


TTCTGAAGTT 


120 


TCCTTTGAAA 


GTTGATTGAC 


ATTCATTTCT 


AAACGCATTA 


TTTTTTCAAA 


TATTTCATTG 


180 


CGATCCAAAT 


GTAATAATGC 


CTCCTTAATG 


CATAGTTTCT 


ATTTCTTCTA 


ATTTATATTC 


240 


AAGTGGCTGT 


TCATGTCCCT 


CAAGCTTCAC 


CTGCATAGAA 


ATGTCTAATA 


TATTTAAAGC 


300 


AACTACTTTC 


CCGGTTACCA 


TCAGGCGTTT 


CAATTGCTTC 


ACCAATATCA 


GGTAATTGTG 


360 


CACGTACTTC 


CTCATAATAG 


TCATTTTCAT 


ATTTTAAACA 


ACACATCAAG 


ACGACCACAT 


420 


GCAACCAGAA 


ATTTTAGTTG 


GATTTAATGA 


TAAATTTGGG 


TCCCTTAGCC 


ATCTTTAATC 


480 


GATACTGGGT 


TCAAAATCCC 


CCTAGAAAAT 


GTAGGAACAA 


CATAACGACC 


TACCCACAAG 


540 
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GGTCCCGATA CCGCCAAGCA ATTGGGCTTC AAT 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 870 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



20 



25 



30 



35 



CCTTAAGAAG 


CACCTTTACC 


AATTCCGCCA 


AAATCTGTAA 


ANTCAAATGT 


AGTAGGTAAT 


60 


GTTTTTTTAG 


GTTGAACCAT 


TTCTTCTAAT 


TTAAGTAGTC 


TTGCATCTGN 


CACTTCNACA 


120 


ATACCCACAT 


TTGGGTCNAT 


AGTTGCGANC 


GGATAGTTTN 


CTCCCAAGGC 


ACCCGCCTTT 


180 


GTAATTGCAT 


TAAATAATGT 


TGATTTACCA 


ACGTTTGGTA 


AGCCAACGAT 


ACCTGCTGTT 


240 


AAAGCCATGA 


ATCATTCTCC 


TATCTTNGTG 


TATCATCCTG 


AGATACGATG 


ATTTTTTTTA 


300 


GTTTTTTATC 


AAACGTTTGA 


CGTGGAATCA 


TAATGCTTCG 


TTGACAATTT 


TCACACTTAA 


360 


TTCTAATGTC 


AGCACCC ATT 


CTAATAATTT 


TAAAACGATT 


GGTTCCACAC 


GCATGTTGTT 


420 


TTTTCATTTC 


TACTATATCA 


TTTATTCCAT 


ATTGNGACGC 


CATTAAATAT 


GACCTCCATG 


480 


TATTATAAAC 


TACTAAACAC 


CGNTTTCACT 


ATGATTAAAA 


GGTGTCATTA 


TTGGNTGTGG 


540 


TGTCTTGATA 


CCTTCTTGTA 


AAAACATTTT 


GTGGACTTCT 


TTACGAATAA 


TACGCGCACC 


600 


AGAGAAACCN 


TCACCTGGTA 


TTGTTTCTGC 


TGAAACCCNT 


AATATTACTC 


TAGTATCTTC 


660 


AAAGGCATCA 


AT AC C AATCA 


CAACAGGATC 


ACTTACAAAT 


AAATNNATAT 


TTACTACGCA 


720 


GAGGATGGTA 


AATAATGTGT 


TTAATTTCTT 


CTCAACATTA 


TCAATATTTT 


CATCTACAGA 


780 


TACTGGAATT 


TTAACAATTG 


CTGGTTCCAC 


TCGGNAATTG 


AGTAANTCGG 


TATTCACCCC 


840 


NTNCTACTAT 


TTGGGAAGAT 


TGGRRDCTNS 








870 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



so (ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

55 TAAGATATTA AGTTATAAAA ATGATTTGAA AAATAAAGTA GAAAACTTAA ACAATTTAAG 60 

TCCAACTAAT ACNATGTTGC GTGGATATGC AATTGTTAAT AAAAAAGACG AAGTCATTAC 120 
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GAG TACT AAA GATTTAACAG AAAATGATCA ATTGACGTTA ACAATGAAAG ATGGCTTAGT 
AGATGCAAAA GTTACGAAAG TAAGGTGTAA TAATGACTAA AGAAACGCAA AGTTTTGAAG 
AAATGATGCA AGAATTAGAG CAAATTGTTC AAAAATTAGA TAATGAAACA GGTATCTTTA 
GAGGAATCAT TAGATTTATA TCAACGTGGG TNTTGAAACT ATCAGCAGCT TGGTGACACA 
ACTTTTAAAA AATGCCGANA AAAAGGNGAA TGACTTAAT A AAAGAAGAAG CTGAGATTGT 
TAAAAATGAC GATCTTCCGA NTGANTTAAT TATAGANGGA NGTCAATATG ATNTCCGGTG 
GCGATAATTA NCCCAGNAAT GGTTCTCNGT NGGAAGGAG 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 734 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



25 


(xi). 


SEQUENCE DESCRIPTION: 


SEQ ID NO:! 


51: 








CCAATCGCNT 


GTNTGACTGT 


NTTGTCACTG 


ANAGCGTTAT 


CTTNTTT AC C 


AGANTTGACA 


60 




AC TNAC ATTN 


TCGTANTCAT 


AGGTTGACTA 


CGCNTAACTT 


GATAGTCACC 


TGTATCTTTC 


120 


30 


AATTGTTTTA 


AAGAGTCTTT 


GNCTAAGCTA 


TCCGTACCTC 


TATCATCTGT 


GAGCCANAGT 


180 




CGTTTCACCT 


TTTTTCATTG 


ATAGGAATGC 


TGTTTCACCA 


GCAGGCATTA 


CTTTTGCTTG 


240 




TACTTTGTTA 


AGTTAAGACT 


TTTCGCCCCA 


GTATTGATCA 


TTTTTGTTAA 


AGTCTGCAGA 


300 




CTCATCTTTT 


TTGTGTTCAC 


CTAATTTAAA 


TGGACCAGTA 


CCATCGAACT 


TTTTAACGCC 


360 


35 


ATCTTTTGTT 


GTACCGTTTT 


TAAAGTCTTT 


TGGAGACACA 


AATACATATG 


GACGAGGCAT 


420 




CGCTAATTCA 


GCCAATGCAG 


GTTGATATGC 


TTCTTTCAAA 


TTCAATTCAA 


CCGTGTACTT 


480 




ATCTTTAACT 


TTAACATTGT 


CAATTAATGT 


TGAAATCTTT 


AACCAAGAAT 


GCAATTTTTT 


540 




ATTTTGTTGA 


NCCGCATCAA 


TATTTTTCTT 


AACTGCGTCA 


GCATCAAATG 


TTGNTCCATC 


600 


40 


ATGGAATCTC 


ACATCATCTC 


TTAAATGGAA 


TTGTAT TAAG 


TCTTCCCATC 


TTCAAGACAC 


660 




ATCCCATTTT 


TTAGC TAGGT 


AAAGGCTTAA 


TACCATCTTT 


CGGTGTTACG 


GTACAAAGCG 


720 




GCTCGTARRD 


CTNS 










734 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



180 
240 
300 
360 
420 
480 
519 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



5 



10 



15 



20 



25 



GNGNTNCTTA 


TNCGATTCAC 


AGTGAAGTAG 


GTGATAAGAT 


GATTGGTCCC 


AAGC TGAAAT 


60 


CCCCAAAATT 


GTGCCAATTG 


ACTATATTTT 


NCAAACAGGC 


GATATTGTTG 


AAATACGTAC 


120 


TAGTAAACAT 


TCATATGGAC 


CAAGTCGTGA 


TTGGTTGAAN 


ATTGTTAAAT 


CGTCTAGTGC 


180 


CAAAGGTAAA 


ATTAAAAGTT 


TCTTCNCAAN 


ACAAGATCGT 


TCATCTAATA 


TTGAAAAAGC 


240 


CCGAATGATG 


GTTGAAGTTG 


AAATAAAAGA 


CCAAGGATTT 


AGAGTCGAAG 


ATATTTTGAC 


300 


AGAGAAAAAT 


ATTCAGGTTG 


TTAATGANAA 


ATATAACTTT 


NCAAATGAAG 


ATGATTTATT 


360 


CGCAGCTGTA 


GGATTTGGCG 


GCGTGACATC 


CTTACAGATT 


GTTAATAAAT 


TAACTGAAAG 


420 


ACAACGTATT 


TTAGATAAAC 


AACGTGCTTT 


AAATGAAGCA 


CAAGAAGTTA 


CGAAATCATT 


480 


GCCTATTAAA 


GACAACATCA 


TTACTGATAG 


TGGTGTCTAT 


GTAGAAGGTT 


TAGAAAATGT 


540 


ACTTATCAAG 


TTGNCAAAAT 


GTTGTAATCC 


TAT AC CAGGT 


GATGATATTG 


TAGGTTATAT 


600 


CACCAAAGGT 


CACGGCACGA 


AAGTACATCG 


CACTGATTGC 


CCAAATATTA 


AGAACGAAAC 


660 


TGAACGACTA 


ATTAATGTTG 


AATGGGTAAA 


ATCNAAAGAC 


GCAACTCAAA 


AATATCAGGT 


720 


TGATTTAGAG 


GTAACTGCGT 


ATGACCGAAA 


TGGCTTGTTG 


AATGAAGTAC 


T AC AAGC TG T 


780 


TAGCTCGACA 


GNCGGCAATT 


TAATTAAAGT 


TTCAGGACGT 


TCAGATATTG 


ATNNNAATNC 


840 


AATAATAAAT 


ATTAGTGTCA 


TGGTGAAAAA 


CGTGAATGAT 


GTTTATCGTG 


TGGTAGAAAA 


900 


GACCAAACAA 


CTTGGTGATG 


TTTATACAGG 


TAACAAGAGT 


TTGGGAACTA 


GAGGGTCCAA 


960 


AAATATTGAA 


AGTAGGTGGT 


ACAAAAGAGG 


TTAAAGAAGC 


ATCGGGTGGA 


CGAATTGATA 


1020 


CATTAAATAA 


TCAAATCAAA 


AAAAGGATAT 


TGGTTTATTA 


GTCGGGGATC 


CGGNCAAGAA 


1080 


CTCTACAAGA 


GCAAAGATGC 


AGATGTAATT 


GCAAAGAAAA 


TTGCTAATGC 


AAGATTATTT 


1140 


GAAGATGACA 


ATAATAAATT 


AAACTTTAAT 


ATCCAACAAA 


TGAATGGTGA 


AATACTATCA 


1200 


GTTTCACAAT 


TTACTCTCG 










1219 



30 

(2) INFORMATION FOR SEQ ID NO: 53: 
<i) SEQUENCE CHARACTERISTICS: 

35 

(A) LENGTH: 498 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 





AC T AAAATG A 


TTAAATTACT 


TATCTCTTTT 


CATCTGATTG 


ATTATCGAAA 


TTTCTTCCTT 


60 




CTAAACCTGC 


TAACTC TTCT 


TTAGAAGCTG 


NAGGTGCTTT 


CATTTCAAAT 


ATCTCATTCA 


120 


50 


CTACTGTGTA 


ATCGGAATAT 


CCTAATCTGG 


CAATAGGTTT 


AATCGACTTA 


ATGTCCAATT 


180 




TACCATTATC 


AAGAATAACC 


NTATCGGCAA 


TATGAACTTG 


GGCAACTCTT 


CCTATAACAA 


240 




TATCTACGGT 


AGATACTGGG 


TCTCCTGGCG 


GAATACGAAT 


CGGGTGAACG 


GTACTCACAT 


300 




NCAAAATGTA 


CTGGCGATCT 


TNGACTCGAT 


AACCNGGGGG 


CTTCTATACA 


ATGNTNCTTN 


360 


55 


CGGGCACAAC 


CGGNATATTN 


AAAATNCAAC 


CCCCCNCAGG 


TGGGTAGGTG 


CCTTCCGACG 


420 




GATAAATTAA 


CAGGCTTCTC 


TTTAAATCAT 


ACGGTGGCCA 


TGTNCCAATA 


CAAACCCACC 


480 



61 



10 



15 



20 



30 



40 



SO 
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CCGGTTTCTT CCTGGAAT 498 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 922 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 



CTGTCAATGA 


ATATGATGAT 


ACTGATAAAA 


TTTTCTCTAA 


CCCATCAAAC 


AAGAAAACAG 


60 


AAGATTATAT 


TTCAGGAAGG 


TTTGGTTGAT 


ATATAATGGC 


AATAATTAGA 


CAGCGATATC 


120 


AGGAGCAACT 


TGATGATTTA 


ATAAAGGAAT 


TACGTCGGTT 


AGGTGCAAAT 


GTCTATGTGA 


180 


GTATTGAAAA 


TGGTATAAAA 


TCATTAAGTA 


TTGACGATAG 


AGGCTTTGNA 


CGACAAACAG 


240 


TTAAAAACGA 


TAAACATATC 


AATCAATTAA 


ATTATGATAT 


TAATGAGCGA 


GTTATCATGT 


300 


TAATTACAAA 


GCAACAGNCC 


ATTGCGAGTG 


ATTTGCGTAT 


GATGATTTCT 


TCAANTAAAA 


360 


ATCGNCTCCG 


ATTTAGAAAG 


AATAGGAGAT 


AATGCATCGA 


GTATTGCCAA 


TATTCGATTG 


420 


CGTACAAAGA 


TTACAGATGA 


TTATGTGTTA 


ACCCGTTTAA 


AGACAATGGG 


TAAATTAGCT 


480 


ATGTTAATGC 


TAAAGGACTT 


AGATCAAGCA 


TTTAAAAAGA 


AAGATACCGT 


ATTAATAAGA 


540 


GAAATAATTG 


AGCGTGATGA 


AGATATCGAT 


GACTTATATA 


GTCATATTAT 


TAACGCAACG 


600 


TATCTTATTG 


ATAACGATCC 


ATTTGTCGCT 


GCACAAGCTC 


ATTTAGCAGC 


AAGACATTTA 


660 


GAACGTATTG 


GTGATCATAT 


TATTAACATC 


GCTGAAAGTG 


TTTATTTTTA 


TTTAACAGGT 


720 


ACACATTACG 


AACAAATAAC 


TTAAAGTTAT 


TACTATAAAA 


TCCCTTACGA 


TAAATATATA 


780 


TTTCTATCAT 


TCATAAACCC 


TCAAAAAAAA 


CCAAGATTCT 


CACAAATTAG 


NAATGNGTGA 


840 


AAANCTTNGN 


GNTATATTTT 


GGTTCTTACT 


TATTAAATNG 


GTCTCGCATC 


TTAGGNTATT 


900 


TGGNTTGGTC 


AATTTCATCT 


TG 








922 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CCAATCGGTC TTATCTTTCA ACACGTTTGA TTGTANCGGA TATCACCCGG TTCATAAATC 60 
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CNAAATCGTT 


G T AAT ATATT 


TCTCTTTATA 


ITLA11A1 J. I 


v_ 1 AVjWjAA i V-C 




120 




TGCGGGCAAG 


TTTCTGGGAA 


ATTTAACAGC 




1 I v_ AAV_ AAA X 




180 


c 


TTCGATGTTC 


TAAAATTTCT 


GCTTC 1 1 vjA X 




TfAATTATAT 


TTTAATATTT 


240 




CTATTAAAAA 


TAATTCTAAT 


AGTCGATG I C 


1\_ i 1 AA 1 1 


nl l_AAljt_tj X A 


Tf2T'<7T r ^ A A AC" 

X Vj\_ O I i /Vrt-TV V— 


J U U 




CATCCTCTGT 


TAATCTAACA 


CCTTTGTATG 


GTTTTGTTTC 


71 ITS TirTTT 1 
nALA J. Alji_ L, i 


1111 l\_rtrt 


360 




GACGTCCTAC 


CATTTCACTT 


ACAGATGGAG 


GC TT AAT AT T 


mjj jaji TTV*" a 
lAAAAAX ivjA 


uH 1 Artkjrt X 1 1 


42 0 


10 


TATTTGTCAC 


AAAGTTTTTA 


TCGCCATTAA 


TTCGTAAGGA 


I 1 X i 1 A 


AAA 1 AAu 1 V— v~ 


/on 

O U 




TCTTTTTCTT 


CAGNTAACAT 


ACTTTCACCT 


CAACAGAt- A I 


I Vj iLiAlAl X 


7\ <T» TV TV"* A PP 71 A 






TTTACTTGAC 


ATGATAAATA 


TTCTCAGTTT 


ATTANACAAT 


TAATTAGGTT 


AGCCTAAACT 


600 




TTTAATTAGG 


AGGTATAAAC 


GTTTGTTAGA 


AACANAAGAT 


TTAAATCTGT 


TTTTAGGTAA 


C C f\ 


15 


TAAGCATGTA 


CTTAAAAACA 


TTTCCTTATC 


GATACCAGCA 


CGAGCGGCGA 


AATAATTGGT 


720 




ATCATGGGCC 


CGAATGGTGC 


TGGTAAATCT 


TCCCTTATCA 


AGTCTTTAAT 


TGGTGAATTT 


780 




AATGCTACCG 


GTACTAAATN 


GTTNTATAAC 


AAACCTATAC 


AACAACAACN 


GC AAC AG ATT 


840 




ACATATATTC 


CACAAAAAGC 


ACATATTGAT 


TTAGATTTTC 


CTATAAGTGT 


GGAACAAGTG 


900 


20 


ATTTTATCAG 


GTTGCTACAA 


AGAAATTGGA 


TGGTTTAGAC 


GACCTAATAA 


ATCAGCAAGG 


960 




GATTAAACTC 


AAACAGTTAT 


TAAGCGATTT 


AGAATTAGAA 


TCTTTTACGT 


CATCGACAAA 


1020 




ATTTCAAGAA 


TTAAGTGGGT 


GGGACAAATT 


ACAACGGTGG 


TGCTTANGTA 


GGAAAGAGCA 


1080 




TTGGATGGTC 


CCGAAAAGTG 


AAGGTTTATT 


TTCTTTAGAT 


NGAGCCCGGT 


TGGTCGGGAA 


1140 


25 


TTTGGATTTT 
AACAAAGGA 


AGGGTAGCGA 


AAAAATTAAT 


CAATGACAAA 


AAATCCGAGA 


ACTTTAAAAC 


1200 
1209 



(2) INFORMATION FOR SEQ ID NO: 56: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 369 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 





GAATTCATNT 


TATCTATCGG 


NAATCATTNG 


TAATACATCT 


AC T AATT AT A 


GTGTTTTATT 


60 


45 


CGATAATATA 


AC TGC ATGT A 


CATCTANAGA 


AGCCTCTTGC 


CTNGAAATCG 


CACCGTCAAA 


120 




C TTAAAACT A 


TAACCTATTA 


AATCTGCTTG 


NGTGNCATTA 


CCACTCACAG 


NACAGCGATT 


180 




AAATGATGCA 


GTAGCAACCA AATCGTTTCA 


TGCACTTNCA 


CATAANTACG 


ACTGACATTA 


240 




NGCTGTGTAG 


ATGTACCTGC 


TTCTANCTCG 


CCAAGAACAT 


CTTGGANTNC 


GNGNTNGCTA 


300 


50 


TTTGGNGCAA 
TAACTGCTT 


TGTCANCTTC 


ATCCATNGAG 


CGCTCAATAG 


TGTCAGCTGC 


CAGAGCGGCA 


360 
369 



(2) INFORMATION FOR SEQ ID NO: 57: 

55 

(i) SEQUENCE CHARACTERISTICS: 
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10 



(A) LENGTH: 268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 



CCTTATTATA AATTAAAGAA AGGTAGTGGT TTC TATGAAA TACAATACTA ATGTTAAACA 60 

75 TACAACTTTA GAAGCGTTTG TCACAACTGT CAATGATTTG GGTATTGAAT TAATTATCAA 120 

TGAAGCACTT CGAGAGGTAA GAAAACGACA GCTCATAGAA CTTATAGATG ACGCACTCGT 180 

CAATAAAGAT GAAGCAGCAT TTAATCAATA TACGGCAGAA TACAAAAATT TGGAGGCATT 240 

TCTCGGTGNA TAACATTGAT TCGAATTC 2 68 

20 

(2) INFORMATION FOR SEQ ID NO: 58: 



(i) SEQUENCE CHARACTERISTICS : 

25 (A) LENGTH: 512 base pairs 

(B) TYPE: nucleic acid 

<C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 



40 



45 



CTATACAACT 


AAAGTAATTC 


ACAATTCAAA 


CTTTTGAAAG 


GGTGTACAAA 


GTGAAAGTAG 


60 


CAGTCATTAT 


GGGCAGTTCT 


TCCGATTGGA 


AAATTATGCA 


AGAGAGTTGT 


AACATGTTGG 


120 


ATTATTTGGA 


AATTCCGTAC 


GAAAAACAAG 


TAGTATCCGC 


ACATCGTACG 


CCAAAAATGA 


180 


TGGTTCAATT 


TGCTTCTGAA 


GC GAG AG AAA 


GAGGTATAAA 


CATTATCATT 


GCAGGCGCTG 


240 


GCGGTGCGGN 


ACATTTACCA 


GGTATGGTTG 


CATCATTAAC 


GACGC TACCA 


GTTATTGGTG 


300 


TGCCGATTGA 


AACANAAAAG 


TTTAAAGGGT 


ATTAGATTCT 


TTTATTATCA 


ATTGCTCAAA 


360 


ATGNCAGGAG 


GGTATTCCNG 


NTGGAACGAC 


TTGCAANTGG 


GTGCAGNAGG 


GTGCTTAAAA 


420 


ACGCAGGTAT 


TCTTGGCAGG 


AAGAATGTTA 


AGTNTTCAAA 


ANTCCTCTTT 


TGGNNGGANA 


480 


ANTTAATCAA 


GNTTGATCTT 


CCGNAATCAA 


AA 






512 



(2) INFORMATION FOR SEQ ID NO: 59: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 





GAATTCCCCA 


AGTTCCTCGC 


ATTTAATTGA 


ACTTTATAAT 


ATTTGTAGTC 


ATCCACAAGC 


60 


10 


AAATATATAC 


ACTCCCAAAG 


GTTTACATAA 


TTCCAAATTA 


GACACACAAC 


GGTGGATTGA 


120 




AAAATGGCAA 


AACC ATTGGC 


AACAATATCA 


ATTTGGTTAC 


TTTGTATTGG 


TGAAAAAAAT 


180 




AGATTGTAGT 


GNTATTGGTA 


TTTGTGGATA 


TGAATATCGA 


CAATTAAAGC 


AAGAAACAGN 


240 




ACTAANTTTA 


TTTNATAAAT 


TACATCCAAG 


TTTTGGNGGA 


CAAGGGGTAC 


GCATGTGAGG 


300 


15 


CTNTTACAGC 


AATCACAAAT 


T 








321 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION : ' SEQ ID NO: 60: 





GGCACGACTC 


GTGCCGCTCC 


TGCACAAGAT 


GTAGGAGCAT 


TTTTATATTT 


AAATAAC T AG 


60 




AGTAATTAAC 


GTAAAGGCGT 


GTGATACAGT 


GAAAACAATT 


GATTAAATTA 


ACACCGAAGC 


120 


35 


AAGAAAAGTT 


TGTGCTAGGA 


CTCATAGAGG 


GCAAGAGCCA 


ACGGAAAGCA 


TATATTGACG 


180 




CAGGGTATTC 


GACTAAAGGT 


AAAAGTGATA 


ATTATATTGA 


TAGCCGAGCT 


TTTGAGTTGA 


240 




GTAAGAATAG 


TGCGGGTTTA 


GATAGGTATG 


AAGAATTGCG 


TCAAGAAGCA 


GGTTGAACAA 


300 




TCAAAATGGA 


CACGCCAAAA 


GGGTTTTGAA 


GAATATGAGT 


GGGTAAAGAA 


TGTNGCTAAG 


360 


40 


AATTGACATT 


GNAATAGAGG 


GGAGGTAAAG 


GAAAGGGACA 


GGTTGATGCA 


TTCCCCCCTA 


420 




GT 












422 



(2) INFORMATION FOR SEQ ID NO: 61: 

45 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 395 base pairs 

(B) TYPE: nucleic acid 
so (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
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GGCCCGAGCG GGACTTAAAA AGGCCACCAC TGGTTGTGAC CTANCCTTAT TTACNTTTAT 60 

5 AAATATAAGG AGGAGGTAGT AGTGAAAGAC TTATTGCAAG CACAGCAAAA GCTTATACCG 120 

GATCTCATAG ATAAAATGTA TAAACGTTTC TCTATTCTTA CTACTATCTC AAAAAATCAG 180 

CCTGTCGGAC GTCGAAGTTT AAGCGAACAT ATGGATATGA CTGAACGTGT ACTGCGCTCT 240 

GAAACAGATA TGCTTAAGAA ACAAGATGTG ATAAAAGTTA AGCCTACCGG AATGGAAATT 300 

10 ACAGCTGAAG GTGAGCAACT GATTTCGCAA TTGAAAGGTT ACTGNGATAT CTATGGAGAT 3 60 

GATAATCGTC TTGTCAAGAA GGGTATTTAA AGAAT 395 

(2) INFORMATION FOR SEQ ID NO: 62: 

15 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1493 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 





CTGTGATTTA 


TCATTCGATT 


GCATGATTAG 


AGGGAGGGAT 


TAAACGTGAC 


ATATCATGAG 


60 


30 


CGTGTTTTAG 


CATTAAGAGC 


AGANAGTAAN 


AGNNCCGCAT 


TTGATTTTCG 


ATTCGGNAGA 


120 




TTTATTTACC 


NAAGAAGAAT 


GGCTAAGTAT 


GTCTCTTGCA 


GAAAGACNCA 


AAGCTGANAA 


180 




AGCATTTCGA 


CACGNAGTTA 


AAAATATGGA 


CGATGTAAGA 


ATGCCCTTCT 


CAAGTGTCCA 


240 




TGACGCCCCA 


AGTAAAATTA 


TATAATGTTG 


TATATTCTGA 


TANCGGCATT 


AAACGTAATT 


300 


35 


TTAAACNAGC 


TGAAAATGAA 


GGATTCTAAT 


ATCATTTCGT 


TTATATATAG 


CAGACATGAT 


360 




AGAATTTTAT 


ATGTAAATCT 


TGTAGGTAAT 


CGTTTTNNAA 


ATAATATAAG 


TATGACTAAA 


420 




NCCACATCCA 


ATATAGGACG 


GGGCTTTTAG 


TATCGTTCCA 


GATTTGTGGG 


GAGTTAGGGT 


480 




TT AC TTAATT 


AAAGGGGTCT 


GCCCTCAATT 


GCTCACCAAC 


GAGGGCAAGT 


TACATCAATC 


540 


40 


AC AC TTCAAT 


TGCCGCCGAC 


TAGGGGTAGT 


AATCATTGGC 


AATAAGAACT 


AGTTAGTGAC 


600 




TAACGGATTT 


ACGTTCCATA 


AGCAAAGTGA 


TACAAACGCT 


CAGCATCAAT 


GTAAGGCATA 


660 




GAATCAATAG 


TTAGGTAACC 


ATCTTTGATG 


TCACCAATTA 


ATCCTTGATT 


TACACCAGTG 


720 




TTCACGTAAA 


TATCATAGCT 


ACTTACGTCT 


GAGAAAATAA 


GCGCTTCAAT 


CTCTTTTCTC 


780 


45 


ATAATCATCA 


CTCCAATGTT 


TATATATTAT 


TTATATAAAC 


TCTCTCGTCT 


CTCTCTATTT 


840 




ATTAACTTCT 


TACAAGTCTA 


ATATTACATG 


AATTTCCAAA 


TAAATAAAGA 


GGTTTGTGAT 


900 




GTATTTCACA 


AATTTATCAC 


TATTTTGGAA 


AATTAAGAAA 


TAGTTAATTA 


TATAAAAGTT 


960 




TAATAAGTCA 


GAAAATATGA 


TAAAATGTAG 


ATGTTCTTTA 


GACATTAAAA GCTTCTAACA 


1020 


50 


TGATACGGAA 


TATGAGGTTC 


CTGTATCACA 


TTAGAAGCTT 


TTTTGTTGCG 


GTGCTTATAT 


1080 




TAATAATGGC 


ATGAATGAAC 


GCATGCTGTC 


TTAGTTTCAG 


GAATCGGCTA 


GAATATTAAA 


1140 




CTTCCGTTCT 


TTGAATATTT 


NTTAATTTCT 


TACTAATATC 


ATCCAGTTCA 




1200 




TTTTAACCCA 


ATC TCCTTGA 


GCTACATTAT 


CAAATTCACC 


TTTAAATTTT 


TTGANATGGC 


1260 


55 


TACCTATATT 


TCACAGCTCA 


CTTTAGATAT 


CGAAC TTGTT 


TCTTCAGAGT 


TAC TTGTT AT 


1320 




ATTACCAGGT 


TTTATTTGAT 


ATTCAGTTTG 


ACTCATAACT 


TGTATCCTCA 


CTTGAAGTAT 


1380 
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AGTGTACAGA 


ATCTGTAAAA 


TAAATGTTTT 


CTTCTTTATT 


TTCTACAAAG 


GTAAATTCTA 


1440 


5 


GATTTTTTGA 


CCCTACAGAT 


GAGCCTTTTA 


AATCACCATC 


NCCTTTTATT 


AAG 


1493 




(2) INFORMATION FOR SEQ 


ID NO: 63: 










<i) SEQUENCE CHARACTERISTICS: 








10 


(A) 


LENGTH: 1215 base pairs 










(B) 


TYPE : nucleic acid 












(C) 


STRANDEDNESS : single 










15 


(D) 


TOPOLOGY: 


linear 










(ii) 


MOLECULE TYPE: Genomic DNA 








20 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 63 : 








TGATGTGATC 


CAACCAAGTG 


ATAGCAATGT 


ATATGCAGTT 


GGAGAATGTG 


CAGAACATAC 


en 
bU 




CGCCAAAGTT 


TATGGACTCG 


TTGCACCACT 


ATATGAACAA 


GGTAAAGTAA 


AGCTGATCAT 


1 in 
LZ U 




TTAACAAATA 


AAGAAACGAA 


CGGATACAAG 


GGATCAACAA 


CAAATTACGT 


CATTAAAAGT 




25 


TTCTGGTTGT 


GACTTGTATA 


GTGCTGGTCA 


AAGTTGTAGA 


AAATGCAGAA 


ATTAAAGGTA 






TTGAAATATT 


TAATAGTGTT 


GATAATAACA 


ATAAAAAAAT 


CTTTTTAAAA 


GACGGTAATG 


inn 




TAGTTGGTGC 


AGTATCGTGC 


CAATATGGTG 


ATATCGATGA 


TGGTTCACGC 


TTTTA I AALA 


JDU 




TGATGAAAAA AGGTGAATCC 


ACTGAAGATT 


ACACACTTGT 


ATCATTGCTT 


ACTAAAV»vj i\f 


*4 Z U 


30 


GAGAAGAGGC 


ATCGCTATCA 


ATTGCTGATA 


TGGCTGATGA 


TGAAACAATT 


TG IXsO J. I vj I A 






ATGGTGTTGA 


TAAAGGTACT 


ATAGTAAATG 


CGATTACGGA 


AAATGGCTTT 


ACAACAGTTG 


34U 




AAGAAGTAAC 


GGCTAAAACA 


AAAGCGGGGA 


ATTCATGTGG 


TAAATGTAAA 


tubCAAAi I\» 


finn 

OwV 




CTCAAATATT 


GCAGCACACC 


TTAGGAGATG 


ACTTTGTTGC 


CGCAAAACCT 


GCTGGTATAT 


CCA 


35 


GTGGTTGTAC 


TGATTTGACA 


CGCGATCAAA 


TTGTAACGCA 


AATAAGAGCG 


AAAGGTTTAA 


720 




AAACATCTAA 


AGAAGTTCGA 


CATGTTTTAA 


ACTTTAAAAA 


TAAAGGTGGA 


TGTCCAAAAT 


780 




GTCGACCAGC 


AATCAACTAT 


TATTTAAACA 


TGGTTTATCC 


ACATGATCAT 


GAAGATGAAA 


840 




GAGAATCAAG 


ATTTGCTAAC 


GAACGTTACC 


ATGCGAATAT 


TCAAAATGAT 


GGTACATTTT 


900 


40 


CTGTTATACC 


TCAAATGCGT 


GGAGGTGTTA 


CAGATGCAGA 


CCAACTGATT 


CGTCTAGGAG 


960 




AAGTGGCTAA 


GAAATATCAT 


GTGCCACTAG 


TTAAAGTGAC 


AGGTTCGCAA 


CGTGTTGGTT 


1020 




TGTATGGAGG 


TTAAAAAAGA 


AGAATTACCA 


AATATATGGG 


AAGACTAGGT 


ATGCGTTCAG 


1080 




CATCAGCTTA 


TGGTAAGAAA 


ACACGCTCAA 


GTTAAAAGCT 


GTGTTGGGTA 


AAGAGTTTGG 


1140 


45 


GCGATTGGGT 


ACGCAATACA 


ACGACAACGA 


CTTGGGATCC 


GTTTAGAAAA 


ACATTTGGGT 


1200 




TCAATCGATC 


AACCC 










1215 



(2) INFORMATION FOR SEQ ID NO: 64: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 760 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



5 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: i 


54: 








TTACCTCCAA 


AATCCTTTTA 


AATCCATGCC 


CATAGAAGAT 


GANNC C C AAA 


ATACATTANC 


60 


10 


TAAATCGAAA 


CATTGCCATC 


CATAACTACC 


GTCGAAATCC 


CATCCTCTAT 


TCTCTAATGT 


120 




TCTCAAATAA 


NTTAATGTAC 


TGCTATTACT 


ATTACTTTTA 


TTATTTGAAG 


ACACTGTTTN 


180 




TGGTNTTGGT 


TCT ACT AATG 


GTGTCATTGG 


CACTTTTAAN 


TNTTGACCAA 


NAAATANTAA 


240 




ATTAGGATTT 


GCTATATTAT 


TTGTATTTTG 


AATATTTGAA 


ACTGAAGTTT 


NGTACTTTTA 


300 


15 


ATGCNATAGC 


ACTAAGTGTG 


TCTCCTTTTT 


TTACAGTATA 


GAGTTGTGTT 


TTTGGAGCTT 


360 




CTCTAAACCT 


GTAGTAACCA 


AAGTAATTAG 


TAAGTAACNT 


TAATCGTTTT 


CTTCTGATAA 


420 




GAATNTTTGA 


GCTTCCAAGT 


TTGCAATNTT 


AANTTCTTNA 


GTAGGTNCAT 


TGTTANTTAA 


480 




TAAC TAG ATT 


GTTACCNTGG 


CTTGAGTTTT 


TCGGAGTATT 


TGAAATCTTT 


ATATCTTGAT 


540 


20 


TAATTTCATT 


TCCGTTTGAA 


ATTGCTGATT 


TGTTGTCTAA 


CTTTAAACTT 


GTGTCCGATG 


600 


TTTTAACAGC 


ACCTTCATTT 


TTTATTTTGT 


CTTTTGTCGT 


ATTTTTATTA 


GCATTTAACT 


660 




CTGATTTCGC 


TCGAATACAT 


TTTGCTCATA 


CCCTCTTGTA 


AAATCTTTAG 


ATTTATCAAT 


720 




TTCATCTGCA 


TATGCTTNGT 


TCGACATACC 


CAATGCCAAG 






760 



25 

(2) INFORMATION FOR SEQ ID NO: 65: 



(i> SEQUENCE CHARACTERISTICS: 
3Q (A) LENGTH: 846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 

40 



45 



SO 



55 



ATATGCTGAA 


AATGTATCAT 


CAGGATTCAA 


GATATCAATT 


AAATGATGTG 


GAATACCATC 


60 


CATTTCTTCA 


GGTGTTACTT 


GTGCAGTTCC 


AATATTCATA 


TGTTTGTAAA 


CTTGCATAGA 


120 


ATCACCGCTT 


ATGATTTCAC 


CATTGATACG 


CTTCGCGAGT 


TCTATGCTAA GCTCTGTTTT 


180 


ACCTG AAGC A 


GTTGGCCCCA 


CAATTACTAC 


AATAAAAGGC 


TTATTTTTAT 


TCATATTTAA 


240 


TTCCACTTTC 


TTTATTTTTT 


ATAAAATTGA 


TTCAACCATT 


TAAACAAGTC 


TTCCCAAATT 


300 


GTGTCATGAT 


CTTTTTTCAA 


ATAACACTTC 


ATGACGTTTA 


TGATGATAAA 


GATTCACTTT 


360 


TAACAATTTT 


TAATGCCAGC 


TTCTCTATAT 


ATATTCGCCA 


ATTTTAAAAT 


CCCTTTACCA 


420 


TAATCACCTA 


AAGAATCGGC 


ATATCCTGAA 


ACTAATAATA 


TAGGCATGGC 


ATGATTTAAC 


480 


ATTTTCATAT 


TTTTTAATTG 


TGATGTATGT 


AGCATATAAT 


AGGCTGTGTG 


ATATAATAAT 


540 


TGATNTGACA 


CATTAAATCC 


AC T ATATGGA 


TCTTTAATAA 


AGTTATCTAC 


TTCAATTGGA 


600 


TTACTAGAAA 


TCCAATCACT 


NTGTGTACGT 


AATGGACGTA 


TTTTTTTATT 


AAAACTATNT 


660 


GATACTAACT 


GGTTAACCCA 


TNCAACTCGT 


TTCTCAGCAC 


CATAAATTTG 


TGTAATCAGT 


720 


TGTAACACTT 


TAACGGTTGG 


GTAAACCTTT 


CCATAATTGA 


ATACATACCA 


AGGTCACCAC 


780 
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TTTAAAATTA GACCATCAAC ATATTTGCGG ATATTGGTTT CTTACAAAAC AATCTTAGTT 
ATACGG 



840 
846 



(2) INFORMATION FOR SEQ ID NO: 66: 



10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 



25 



35 



40 



45 



55 



AAGNTGAATT 
TATTTCGTTG 
NAGATTTTCC 
TCCACCACAT 
TATTATGTTA 
AAAAGAAAAC 
TAAGCATAAT 
AANTCCTATG 
TGTTGCCTAT 
AAAGATTAAG 
NNAAGAACTA 
CATGTTTTTG 
TTAAAAGTGA 
GGAGCGTATT 
GCATTTCACA 
TGAAANCAAA 
TGGTGCTATG 
ATCACCTGAC 
TTTTTGATTT 
ATC AATCACT 
AACCCCATTA 
AACAATGGCG 
TTTATAANNN 
ATTTTTTAAA 
AGAATTTTAA 
AATGTAAAGG 
TCTGTAATGA 
GTGCATTGAT 
NCCANAANAA 



TTTAGTACAA 

ATCTTATATA 

TTTGTAAGAG 

TTTAGGACCT 

ATCATANTAA 

CAAAAATCAA 

ACTATAGAAC 

GGAAGTGCTG 

GCATCACCTG 

TGAGTTACTA 

AATANNAAGT 

TAATTATTGG 

AATATGACGA 

ACACTCTATC 

AGTTTAAAAA 

AAAGATAATT 

ATAGAAAGTG 

GAAATCAAAG 

AATCTTCCAA 

TCATCATCAC 

ATGTAAATGA 

AGGACGTGAT 

AGGGTAGAGC 

TAT AAGCGC T 

ATTCTATCCA 

AGAGATTGCA 

GCCAAGGTGA 

TAAAGCTATT 

AGAAGTAAAG 



TTTCCACTGT 

AACACTTTTG 

TGTGTTTTGT 

GGTGANAGAT 

TTAGTGTTGC 

TTTATTATAA 

CGAACACAAC 

TGATTGAAGG 

AAAATAACTT 

AAACCAGCGC 

AGAGTCACTA 

TGTCATTTTA 

AAAAGAAAAA 

TCCAGCATAA 

GAGGACCCAT 

TTACTGCTTA 

AAAGATTAAG 

AAGAATTAGA 

TAATCTATGT 

CAATACTTCC 

AATAATAGTT 

TTATTGACNA 

AAGGTATAAT 

AAATGTTTTA 

ATATTGTTCG 

NATGCCGTAT 

AAAGTTTATT 

AATCTCCAAC 

GAANTGGGTG 



ATTTATCTGT 

GCTGC TTTTC 

AGTATCATAA 

GTCGNAAAAG 

AAGTATCNAN 

AGAACAACAA 

CAANTCTGNA 

TTACATAAAT 

TCAATTTGTA 

ATCAATTNNA 

ANGTTAGGAG 

TGTATATGTA 

CCAAAAAGCA 

TACCAAAGAA 

GGGCGATGCC 

TGCTACACCA 

TGAGTTACTA 

CACCNAAGAA 

CATTGCTATC 

CCAGTTTTGT 

TAGCCATTAT 

CTATCAAATT 

ATCCTTTCAA 

ACTAAAGCAT 

AAGTGTAGTA 

AATTACAAGA 

CATCCAATCC 

AAATTAACGG 

TAACACAACT 



TANCCACTTT 
CTTGACCNAG 
ATATGTAANC 
AANATTTTAA 
TTTAAAATGA 
NCGCGCATTA 
CATTTCNCNA 
GANAATAAAG 
GGTGATTTAN 
AC ATCNC C AG 
TTACTTTAAT 
TAGTTGCATC 
AAATACTATA 
CCCAATACCA 
GTAATTGAAG 
GAACATAATT 
AAACCAGCGC 
GGCCACTAGG 
GAAGGTGTTT 
ACAGTACATT 
AAAAACATTA 
TTCTCANATG 
TCGGTTTTCA 
AGATTGACAA 
TCACTGGANT 
AACACAATGG 
ACCGGTGAAT 
TTGTCATTAT 
GANNATTGAT 



ATACCCGCAA 
GNTAANATTA 
TGTATNAAAN 
TATTANCTAG 
AATATGACGA 
CACTTTATCT 
AATTGGAANC 
AAGATGATTT 
TTNNAAAGTG 
ATGATATNNN 
GAACAAAAAA 
AGTCATTTAT 
AAGAACAACA 
TCAAATCTGT 
GCTACATCAA 
ATC AGTTTGG 
ATCAATTAAA 
GTC TTCTTTA 
CGCAATTAAT 
AACACAAGCG 
TATACCGAAT 
TGTGTGATTT 
TATTTTATAT 
GATGTTATAC 
GG T ATTAAAC 
AGAGTTAATG 
GATGAACTTA 
TGTGTTGATA 
GAAGTCTTGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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5 



10 



15 



20 



NNTTTAGACA 


TTTAGANTTA 


TTTACTTGAT 


CAAGAAAAAG 


TGACGCTTGA 


AGTTGCAGAA 


1800 


ATGNTAAATT 


CAANCAAAGA 


CNTTAAGAAG 


TTTGAAATTA 


TTGACCGGCN 


AAAATCATTT 


1860 


TATGATGAAG 


AACAANTTAT 


TGATCTTGTC 


TTTGTTGTAA 


ACCAAATTAA 


CGGTTGGAAC 


1920 


AGACTAAATA 


TTATTAGTGA 


TAGACTATAA 


TTGTTCATAT 


AAATGCAGAG 


TTTCATCTCG 


1980 


AACGCTATAT 


CATAACAANT 


C ATGC C ACTA 


TACAGGTCAA 


ATCTTGTATA 


GTGGCATTTT 


2040 


TAATTTATCC 


CTTTGAATAC 


TGTTATTTAA 


CGAATATCGG 


TCCACC TGGT 


CCAACAGGGA 


2100 


TACCTAATAA 


GAACCAAATG 


ATGACAAATA 


CTGTCCATAC 


AATGCTTAGC 


GCGATTGAAT 


2160 


ACGGCATTAA 


AC T AG AAAGT 


AAGGC TCCG A 


GTTTCATGCG 


TTTATCGTAT 


TTTTGTGCAT 


2220 


AAGTTAATAA 


TAAAGGTAAG 


TACGGCATCA 


TCGGTGTAAT 


TGGATTGGTA 


ATTGAATCGC 


2280 


CTACACGGTA 


AATGACTTGC 


GTAAATGCGG 


GATGAAAGCC 


GATAAGGATT 


AACATTGGTA 


2340 


CGAATATCGG 


TCCTAAAATA 


CCCCATTTAG 


CCGATGCGCT 


TCCGATTAAC 


ATGTTGACCA 


2400 


TTGCACTCAG 


TACAATAATA 


CCTAGTATCA 


ATACAATACC 


GTTTTGATGT 


TCTAATAATT 


2460 


TGGCACCTTT 


AACAGCAGCG 


ATAATTCCTA 


AATTACTCCA 


CTTTAAATAC 


GCAAGTAGCT 


2520 


GTGCTGCAAA 


AAACACAATA 


ACGATAAATG 


TTCCCATTGA 


TCCTACAGCA 


TCGC CGAAC A 


2580 


TTTTACCTAA 




TTTTTAATTT 


CTTTGCTTAA 


AATCCCATAA 


ACTAATCCAG 


2640 


G T ACT AAAAA 


TACGACAAGA 


ATAATTAATC 


CGACACCGTT 


AATTAATGGC 


GCATCGTCTA 


2700 


GTAAGCTGCC 


TGTTTTAGCA 


TTTCTTAAAA 


AACTATGTTC 


AGGAATAGCT 


GTAATAATTA 


2760 


ATAAAATAAT 


TGTGACTATG 


AAACTGATAT 


TTGCCCATTT 


TAAAGCATGT 


GCTTCTTTGT 


2820 


CAGTAATATG 


TGAAGATGCT 


TCTTCAGGGT 


CATG 






2854 



(2) INFORMATION FOR SEQ ID NO: 67: 
(i) SEQUENCE CHARACTERISTICS: 

30 

(A) LENGTH: 14 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 



40 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 67: 








CTATGATAGN 


ATCGGAAAGA 


TGTATAAAGT 


TATCTAAAAG 


TTATACGACA 


CAAGTACACG 


60 




AGGCACATCG 


CTATGCGGTG 


TGTCTTNTGG 


TATGCAATCA 


AAGAGGTGTA 


AG AGATG AC C 


120 


45 


AAGCATAATA 


ACATTTATTA 


GCATGGNCGT 


AAGTCATATC 


AATACGATTG 


GTTCTATCAT 


180 


TCAAAAGCAT 


GGAAGAAGTT 


AAGAGAGAT A 


GCATTAGATA 


GGGATAATCA 


TCTTTGTCAA 


240 




ATGTGTTTAC 


GTGAAGACAT 


NGTAACAGAT 


GCAAACATAG 


TGCATCATAT 


TATTTATGTT 


300 




GATGAAGATT 


TTAATAAAGC 


TTTAGACTTA 


GATAATTTGA 


TGTCAGTTTG 


TTATAGCTGT 


360 


50 


CATAACAAAA 


TTCATGCAAA 


TGATAATGAC 


AAAAGTAATC 


TTAAGAAAAT 


TAGAGTATTA 


420 


AAAATTTAAA 


TAAAAAAATA 


ATTTATTTTT 


ATAGCCCCCT 


ACCCATCGGC 


TTAAAATGTT 


480 




TTTTCGACGG 


GTACCGGCGG 


GGGCCCTTCG 


CTTGCAACGC 


GGATAAACTT 


TTATGAAAGG 


540 




GGGTCTTTAT 


ATGAAATTAA 


CAAAAAAACA 


GCTGAAAGAA 


TATATAGAGG 


ATTATAAAAA 


600 


55 


ATCTGATGAC 


ATATTAATTA 


ATTTGTATAT 


AGAAACGTAT 


GAATTTTATT 


GTCGGTTAAG 


660 




AGATGAACTT 


AAAAATAGTG 


ATTTGGATGA 


TAGAGCATAC 


AAACAAGGCT 


GGTGCCGAGC 


720 
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AATATTGTTA 


AGAATCCATT 


AAGCATAGAA 


CTGACAAAAA 


CAGNTCAAAC 


ACTAAATAAC 


780 




TTACTCAAGT 


CTATGGGTTT 


ANCTGCAGCA 


CAAAGAAAAA 


AGATAGCNCA 


AGAAGAAGGT 


840 


5 


GGATTCGGTG 


ACTATTAAAG 


TTTTAAATGA 


ACCTTCACCA 


AAACTATTAA 




900 




TGCAGAGCAA 


GTCACTCAAG 


GGAAAATAAA 


AAC AAG C AAA 


TATGTTAAAA 


AAGAATGTGA 


960 




GAGACACCTT 


AGATATCTAG 


AAAATGGAGG 


TAAATGGGTA 


TTTGATGAAG 


AATTAGCGCA 


1020 




CCGCCCTATT 


CGATTCATAG 


AAAAGTTTTG 


TAAACCNTCC 


AAAGGATCTA 


AACGTCAACT 


1080 


10 


TGCATTACAA 


CCATGGCAAC 


ATTTTATTAT 


TGGCAGTTTG 


TTTGGTTGGG 


TTCATAAAGA 


1140 




AACAAAACTG 


CGCAGGTTTA 


AAGAAGCTTT 


GATATTTATG 


GGGGCGAAAA 


AATGGTAAAA 


1200 




CAACTACTAT 


ATCTGGTGTT 


GCTAACTATG 


CTGTTTCTCA 


AGATGGAGAA 


AACGGCGCTG 


1260 




AAATCCATCT 


NTTAGCAAAC 


GTAATGAAAC 


AAGCTAGGAT 


TCTATTTGAT 


GAATCTAAGG 


1320 


15 


CGATGATTAA 


AGCTAGCCCA 


NAGCTTAGAG 


AAAATTTTAG 


ACCTTTGAGA 


GATGAAATTC 


1380 




ATTACGATGC 


ACTATATCTT 


AAAATATGCA 


CAGGCTTCAG 


ACAGTGATAA 


GTGGTTGGTT 


1440 




AA 












1442 



(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
{xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 



35 


GAATTCAAGC 


CGACCGATGA 


NGGTTATNTG 


ANATTACATA 


AGACGTGGTT 


TNGTNANTCA 


60 




AAGCTATGTC 


CAGTTTGTAA 


TCGGAGGCGT 


GCTATGAAAA 


NTAGTTATCA 


CGCTCANANA 


120 




GTGATTGANG 


ANGTAATTAA 


GGAAAAGCCA 


NCAGCACGTT 




ACCACTNTCA 


180 




CCAANANNTG 


CGATAGATGG 


AGATAC TTTA 


GTAACAAAGT 


TNGANGCATC 


TAACTAANGC 


240 


40 


ATTTGATAGG 


TTGAGTAGCA 


TATNAAAAGG 


TTAAACANAA 


TCTNGTTGGA 


TCTATGCGTN 


300 




CAACAGAAGT 


TNCCGCCTAC 


CTAAAAATGA 


CG 






332 



(2) INFORMATION FOR SEQ ID NO: 69: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1070 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
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5 



10 



15 



20 



CCATCCGTTC 


NCNTCATCNC 


ANGACNTTCN 


ANANTTAATT 


TTTACNAGGA 


G TGAC ATCTG 


60 


TGAATAACAC 


AC AATCC TC A 


CCACGCAGTN 


ATANTATTAT 


TGCGATTATG 


TTGTC TGC AT 


120 


TAACATATTG 


GGTGTGTGNA 


CAATCATTTA 


TTAATATAGG 


ACCTCTCGNN 


GGNCAAACGT 


180 


ATCAAACCTC 


TCCTGCCGTG 


TTAAATTTAT 


CTATTAGTTT 


AACTTCCCTC 


GCCACAGGTA 


240 


TCTTCATGGT 


GGCTGCAGGT 


GATATTGCTG 


ATAAAATAGG 


ACAANTGAGA 


ATGACATACA 


300 


TGGGTCTCAT 


AATCAGNATG 


TTTGNATCTC 


TTCTATTAAT 


TATATCGGAC 


ATCACTGCAC 


360 


TGCTCATCAT 


CGGNAGGAAT 


TTTACAAGGT 


CTATCAGCAG 


NTATC TTGGT 


TACCCTCCAA 


420 


CAGTTGGCGT 


TTTAAATAAT 


CAATTTAAAG 


GAGAACATTT 


AAGACGAGCG 


ATTAGTTATC 


480 


TAATGATTAG 


TACTGTTGGG 


GGCATCGGCC 


TAGCTGGTGT 


TATCGGCGGT 


TTAATTGCTA 


540 


CAAATTTCGG 


ATGGCAAATG 


AATTTCATCA 


TTAGTATAGT 


CATTGNTTTC 


ATTGCCATAT 


600 


TACTTCTAAA 


AGGCACACCT 


GAAAAAGTAA 


GTCAACATAG 


NCACCGTCAT 


CCGTTCGATT 


660 


ACAAAGGTAT 


GTCGATTTTC 


GCTGTTATGA 


TTGGTAGCTT 


TACATTATTG 


TTAACACAAG 


720 


GATTCGAACA 


AGGTTGGTTT 


AGTACATTCT 


CATTCATTTG 


TC TGAGC ATT 


TTTATCATCA 


780 


CTACGCTGAT 


ATTCATCATC 


ATCGAACGTC 


GACATGAAGT 


ACCCTTTTAT 


TGATTTCTCA 


840 


GTATTACGCA 


ACCGNCCGNT 


CATTGGTGCA 


TTTTTAAATA 


ACTTTGTTTT 


AAATAGCGGT 


900 


CTAGGCGTTA 


CAGTGGTCTT 


TTTCATATAT 


GCTCAAACAC 


ACCTTGGTTT 


ATCAGCTGCG 


960 


CAATCTGGAT 


TGNTACATTG 


GCATATNCCA 


TAGTGGTAGT 


TGCGATGATT 


CGTCTTAGGT 


1020 


GAAAAAAGCA 


ACATTACGGT 


TCGGGTGGGC 


AAATTGGATG 


CTCATCATGG 




1070 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1246 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 





GAACAANTGG 


TAAAACCACG 


ACTTCNAACT 


TAATTGGACN 


TACTTTAAAA 


GCAAATAATA 


60 




TNCNAATTAT 


ACNCAATAAT 


GAAGGTGCTA 


ATATGGCTGC 


AGGTATAACT 


TCTGCATNCA 


120 


45 


TCATGCAATC 


AACACCTAAG 


AC T AAAATTG 


CGGTAATCGA 


AATTGATGAA 


GGTTCGANTC 


180 




CACGTGTGTT 


AANAGAAGTT 


ACACCTNCAN 


TGATGGTATT 


TACTAATTTC 


TTNAGAGATC 


240 




AAATGGATCG 


CTTCGGTGAA 


ATTGATATTA 


TGGTTAATNA 


CATNGNAGAG 


ACAATTAGTA 


300 




ATANAGGCAT 


CAANTTATTG 


CTAAATGCTG 


ATGATCCATG 


TGNGAGTCGG 


GTGAAAATCG 


360 


50 


CAAGTGANAC 


GATTGTGTAC 


TATGGTATGA 


AAGCACATGC 


CCATGAATTT 


GNACAAATGT 


420 




ACGATGAATG 


AAAGTAGATA 


TTGNCCAANC 


TGTGGTCGCT 


TATTGCAATA 


CGATTATATT 


480 




CATTATAATC 


AAATNGGTCA 


TTATC AC TGT 


CAGNGTGGTT 


TCANACGAGA 


GCAAGCAAAA 


540 




TATGAAATAT 


CANGTTTTGA 


TGTGGCACCG 


TTTCTATATC 


TANATATCAA 


TGATGAAAAN 


600 


55 


TNTGATATGA 


AAATTGCAGG 


TGACTTGAAC 


GCTTATAACG 


CGTTAGNAGC 


ATATACTGTT 


660 




TNAAGAGAGC 


TAGGGTTAAA 


TGAACAANCA 


AATTAANAAT 


GGC TTTGAAT 


ACGTATACAT 


720 
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CAGACAATGG TCGCTATGCA GTACTTTTAA NANAGAACGA AAAGAAGCGA TGACTCAANT 7 80 

TTAGCTAAAA ATCCTGCAGG AATGAATGCA NAGTCTATCA AGTGGGTGAA CAATTAGAAG 840 

GCGAAAAAGT GTATGTTATT TCGCTAAATG ATAACGC TGC AGATGGTCGA GATACTTCAT 900 

GGATTTATGA TGCAGATTTT GGAAAAATTA TCTAAGCAAC AAATTGAAGC TATCATCGTG 960 

ACAGGTACAC GAGCAGAAGA ACTTCAATTG CGATTGAAGT TAGCAGAGGT TGAAGTACCA 1020 

ATTATTGTTG AGCGTGATAT TTATAAAGCA ACGGCAAAGA CTATGGATTA TAAGGTTTCA 1080 

CAGTTGCAAT ACCAAACTAT ACATCAATTA GCGCCTATGC TTGACAATTA AACCGTCCGT 114 0 

TTGAAGGAGG GCAATCATAA TATGCATGAA TTGACTATTT ATCATTTNAT GTCAGATAAA 12 00 

TTGAATTTAT NCAGTGATAT AGGAAATATT ATTGCTTTAA GACAAC 12 46 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 



30 



35 



40 



45 



CATTCAAGAA 


TTATCTATAC 


TTTAAGCCCG ACTAAAATCT ATAGAACAGC 


ATT AC T AAAA 


60 


GATAATGACA 


TTTATTTTCC 


AGAAGAATTA 


AAGAGTGCAG 


AAGATCAATT 


ATTTACAATG 


120 


AAAGCATATT 


TGAATGCAAA 


TCGAATCAGT 


GTGTTAAGTG 


ATAAAGCGTA 


TTATTATGCT 


180 


ACAAAGCGTG 


AAGGTGAACA 


TATGAGTAGT 


GCGTATGTTT 


CACCTGAAGA 


CTTTTACGAA 


240 


GTCATGAGAT 


TGATTGCTGT 


AGAAATATTA 


AATGCAGATT 


TAGAAGAAGC 


TCATAAAGGA 


300 


TCAAATCTTA 


GCAGGAATTT 


TTAAATCCGN 


CATTTTAGTT 


TTTCTCCGTA 


CGAATGGCTT 


360 


CTCACTTAAA 


GTTAAACTTG 


AAGAGCAACC 


AC AATGG ATT 


AATGCTCTAG 


GAGGACTTTA 


420 


TACAAGCAGT 


TCCAGAACGT 


GTAGATGCAT 


TGGTGATGAG 


TAAATTACGA 


CCATTGTTGC 


480 


ACTACGCGAG 


AGCGAAAGAT 


ATAGACAACT 


ATAGAACTGT 


AGAAGAAAGT 


TACCGTCAAG 


540 


GTCAATACTA 


CCGTTNNGNT 


ATTGTAGATG 


GTAAATTAAA 


CATTCAATTC 


AATGAAGGCG 


600 


AACCATACTT 


TGGAAGGCAT 


TGATATCGCT 


AAGCCAAAAG 


TGAAAATGAC 


AGCATTTAAA 


660 


TTTGGATAAT 


CATAAAATTG 


NTACAGGAGC 


TAACGGTTAA 


ATGATTTATG 


ATTGGCGAAG 


720 


GGACAATTAT 


GATGTTCAAA 


GCTTTAAATT 


T 






751 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1394 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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10 



15 



25 



35 



40 



50 



(ii) MOLECULE TYPE: Genomic DNA 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

CCTGTTTGCC CTAATGGANT CGGGTNACAA TNCACGGAGA TCCTTCTTCT CACATCAATN 60 

TNAAATGGGG TGTTC AACC T CATCTTTGAT CTGATGACGA TGACATGCGC GGCATGCAAC 120 

TTGTCCGNTC AATTTGTTCA AGATCATTCG T AC GNAT AAA ATTAGTAATN TTAACGTGTA 180 

TCGGNAATNT TAAATAAATG TTTAATGCTT TTGAGAATAC AGTTCTANCC GNAGACGTCG 240 

GGGACACTAA ATCACCGGAG ACGCCAAGGT CTTTTATTAA ACCTTGTTCA CTATATTGCA 300 

TATACTGTGG AT AC TGTCNC AACACATTGA NTTGATAAGG ATGTGTTGGT AATAAAATAA 3 60 

AATCTTTGGG TATCTCTGAT ATATCTATGT CTGCTAATTG ATACAACACT TTCTCAACCT 42 0 

GATCTTCTTT ACCTTCTACA TAGCGCGTGA GCAGAACATC TGGATGCACA GCTAAATAAT 480 

GCAATTGGAA TGATGTATGA CATTCGGGTG CATATTTCTC TAAATCTGCT TCTGAAAACC 540 

CACTTGCACT CTTAGGAGTC GGGATGAAAT GGATGACCTA AGTATTNAAG ANTGGTCTGA 600 

AACGATATAA CGATCCTCTA CGTAGGC TAT TGTGTTACTT GGGAAAGAAC GCGCCGNGCG 660 

ATGAATGCTA TTATCGATGN CAAAC ATAAT TNGCGCCATA TGTTGGTTGC ACTGCCGTTT 720 

GATTATCTGC ACTTTGAGCC ATATGTGGCA AAATACGCGC AATTGCTTCT TTATAAGTTG 780 

TTATTTTTTT ACTTTTTCCA TCGATAAGCC ATACCTCTGG ATGATACATA TGATGCCCCA 840 

TCGCAGACCA ATAGCGAAAT TCACCCGTTA AAGTTTCGAG CTCTGATAAT TGTATAGACC 900 

ATTGATGATT TTGAGGTGGT ACTTGATATA AATTTTCTTC TCTAAAATAT TCATTTAAAA 960 

TGCGTTCGAT AGCCGCATAC GCTGCATGTT GTATTAATTC TTTATTTTGC ACTTTTTGGT 1020 

TTCAACTCCC ATAATTTCAT TAATGTGTGA TCGGTTGATT TGATTAGTGA TGGTTGAACA 1080 

AATTAAAAAT AAACTACTTA CTGC AAATAC TACGCCCATA ACGATAAACG TAGTAGCTGG 1140 

TGTAGTATAA CTTGTAATGG CAGCGCCACT AAGACTGCCA ATAATTTGAC CAACAACTAA 12 00 

CATACTGTTC GTCGTTCCAA CAAATGTGCC NTTAAGNTGT TGATGACACG CANTC AC G AC 12 60 

AACAAACATG ACACTTTGAA TCNATGCACT AGATGTTAAT CCTTGAAGTA TTCTTGCAGC 1320 

CATTAAANAC TCTATATTCG TCGCTAAACC TTGCAGTATC GCACTACAAC CACATGCAAT 13 80 

CGTGGCAAAT ATAG 13 94 

(2) INFORMATION FOR SEQ ID NO: 73: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 459 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 



ATAATATGTC TAATTGACTA ACTTGTTGGA GTCATTTACT ATTTTATGTA TG AC AT AT TT 60 
55 TAAAAAGTGA GGGTCAAGCA TGTCTTATAA AGCATATCCA CTCTTTAGAG ATATATTAAT 120 

AAATGAATGT ATTTATTTCG CCTCTAAAAA TAAAAAACTA GTACGCCTAA ATTATAAAAG 180 
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TGAAGCGTAT 


GTAGGCGTTT GGACAGAAGA 


AAGTGTGGCC 


GTATCATTTT 


TAACAAGTCG 


240 




TGATATTCCA 


TNTGATAAAG TTGTAAAAAT 


GGACGTTGAT 


CCGCCGNGCT 


ACTTATGAAT 


300 


5 


TAGATGAATT 


GTGTGATGAA CAAGACATAT 


TATTATGAAT 


CAAACAATGG 


AAGAAGAAGG 


360 




GCATCTACTA 


ACGTGGGGCT TGNTACAAAA 


AGAAGGTGAT 


GACGGGATTA 


GATAAAANAG 


420 




ATCAAAGATT 


TGGGCCCAGA TGTTGCAAAT 


ATGATGAAG 






459 


10 


(2) INFORMATION FOR SEQ 


ID NO: 74: 










<i) SEQUENCE CHARACTERISTICS: 










(A) LENGTH: 93 0 base pairs 








15 


(B) TYPE: nucleic acid 










(C) STRANDEDNESS: single 












(D! 


> TOPOLOGY: linear 












(ii) MOLECULE TYPE: Genomic DNA 










<xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 74: 






25 


CTGGTGCGTC 


TTC TAATGAA TTTAGAGATT 


TTTTAC C AAA 


TCACCATATG 


CAGTATACGA 


60 




TGATGAAGTA 


TGCACGTGAA CATGGTGCAA 


CAACTTACGA 


TTTCGGTGGT 


ACAGATAATG 


120 




ATCCAGATAA 


AGACTCTGAA CATTATGGCT 


TATGGGCATT 


TAAAAAAGTG 


TGGGGAACAT 


180 




ACTTAAGTGA 


AAAGATTGGT GAATTTGATT 


ATCTATTGAA 


TCAGCCATTG 


TACCAATTAA 


240 


30 


TTGAGCAAGT 


TAAACCGCGT TTAACAAAAG 


CTAAAATTAA 


AATATCTCGT 


AAATTAAAAC 


300 




GAAAATAGAT 


TAACGACTGA AATCTGAACG 


CTCATAAGAC 


TGTCATTTGC 


GTTCAGATTT 


360 




TTTTACACAA 


TATAGAATGG TTGAGTAAAA 


TATTTTTGAA 


TATAGTGAAA 


GAGGGGGAAG 


420 




TACTGTGATA 


AAAAAGCTAT TACAATTTTC 


TTTGGGGAAT 


AAGTTTGCTA 


TCTTTTTAAT 


480 


35 


GGTTGTTTTA 


GTTGGCTTGG GCGGTGTATA 


CGAGCGAAGT 


GCTAAATTGA 


AATTAGAATT 


540 




ACTACCAAAN 


TGTACAAAAA TCCAGTTATT 


TCAAGTTACA 


ACAACAATGC 


CGGGGTGCAA 


600 




CGCCACAAAG 


TACCCAAGAT GAAATAAGTA 


GTAAAATTGA 


CAATCAAGTA 


AGATCGTTGG 


660 




CATATGTGAA 


AAATGTTAAA ACGCAATCCA 


TACAAAATGC 


TTCAATTGTA 


ACAGTTGAAT 


720 


40 


ATGAAAATAA 


TACAGATATG GATAAAGCAG 


AAGAACAGCT 


TAAAAAAGAA 


ATCGATAAAA 


780 




TTAAATTTAA 


AGATGAAGTT GGTCAAC CAG 


AATTAAGACG 


TAATTCGATG 


GACGCTTTTC 


840 




CGGTTTTAGC 


ATATTCATTT CAAATAAAGA 


GATGACTTGA 


AAAAGTACGA 


AAGTACTGAA 


900 




TGACAATTAA 


TACCAAATAC AAACGGAGAT 








930 



45 

(2) INFORMATION FOR SEQ ID NO; 75: 



(i) SEQUENCE CHARACTERISTICS: 
so <A) LENGTH: 744 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 

(ii) MOLECULE TYPE : Genomic DNA 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

5 



10 



15 



20 



TTTACCTTCC 


TTAATTGCTT 


CCAACCCAAT 


CATCAGCTAA 


TAATCC AC TA 


CTTTTTC ACC 


60 


CATGCTCTTT 


CACNAATGTT 


TTGAACTGCG 


TTTCCTTTAA 


CGTCATACCA 


ACCTGTTTTA 


120 


CCTACTTTTG 


TAATATTACT 


TGCCAACACC 


ATTTTATTAT 


TATAAGCGAT 


TTGGTTAGCT 


180 


ATCGTGAACA 


TAGGTTTTAA 


ACATCGTCTG 


TGCACCCATA 


AAGGTATGCC 


TATCCAGGTT 


240 


TTTTGATTAC 


TATCAGTAGC 


ATCAGATTTC 


C AAAAAC CAT 


ATTGATTGGC 


GTTG TCTGC A 


300 


ACAGACTGCA 


CAGAAGCTTC 


TTTAGAAACT 


AGATATTCCG 


GAACATGATA 


ATTTTT AC G A 


360 


ATGTTATCAA 


TTAAATGACT 


TTC TAAAGTC 


AC AACCGGTT 


CTATTTGAAT 


CGGATCACCT 


420 


ACAGCTACAA 




ACGATATAAT 


GCTCCCACAG 


CTGCTTGAGG 


TATTGCTTGT 


480 


CCTGCTTCAT 


CAATAAATAA 


GTAGTCTATG 


AAATCTTGTG 


GTATGCCCCC 


ATACATAGAT 


540 


TTAAAGCTTG 


CAAACGTCGN 


ACTAACTACT 


GGAAATATNA 


AATGCATCAC 


ATTCCATGCG 


600 


TNGTGTNCTT 


TATCTGGATG 


NGCATCAATT 


AATTTCCTTC 


TATCTTNAAA 


ATCATTAATC 


660 


GCATAATAAA 


TAGTTGGTTT 


TATTAGCAAT 


CAATAATAAT 


TTATGCAATA 


TCAATNGCTT 


720 


CTTAAAAAGA 


GCATGGACCG 


TCTT 








744 



(2) INFORMATION FOR SEQ ID NO: 76: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 858 base pairs 

(B) TYPE: nucleic acid 
3o (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 





GGCCTGGATA 


AGTAGTACCA 


GTCCACATAA 


TCTAACCTAA 


GACAATTAAG 


ATTAAGATTG 


60 


40 


ATCGCATGAT 


TCAATTTATT 


TAGC TTTGT A 


ACCCAATCAT 


ATTGANTAAA 


TCTTTAGGGT 


120 


GCCAATATGG 


TGGTGCATAA 


NCCACNNCAA 


ACTCAGTTAA 


CTCATCTACA 


GTT AGCTGG T 


180 




TCATCATTGC 


CATCGATAGT 


ACATCAATAC 


GTTTATCTGC 


ACCTTCTTTT 


CCTACTGCAG 


240 




CTGCTCTTAA 


AATCTGACGG 


TTTGAAGTGT 


CATAATATAC 


CCTTAAGTGT 


AAAGGGGAAT 


300 


45 


TTCCTGGGTA 


ATAATTCGCG 


TGTGCACCTT 


GAGTG AC TTC 


CACCATTTTA 


TAGTCAAATT 


360 




GCTTTAGTTC 


ATTTGGTTTA 


ACGCCGACAC 


TCGCAAATGT 


ATAATCAAAG 


AACTTCACAA 


420 




TATTGTTGCC 


TAAGAAGCCT 


TTGAATTCAA 


TAGTGTCATT 


TCCAGCAATT 


TGTTCGGCAA 


480 




CAATACTTGC 


TGCACGGTGA 


GCGCCCCAAG 


CTAAAGGAAC 


ACTAGCCGGT 


AGATCGACAT 


540 


50 


GTCGATAATG 


TGATGTTGCA 


ATATCGCCTA 


TTGCATAAAT 


GTTTGGAACA 


TTTGTTTCAA 


600 




ATTTATCGTT 


TACGGTATGA 


AACCTTTTCG 


ATCAAGTTTG 


ATATTTGAAC 


TTTCGATAAA 


660 




TTTTGAATTG 


GGGTG AG TAC 


CGACACCTTC 


AATAATCATA 


TCGTAATGTT 


CAACTTTTCC 


720 




TGATTTAAAT 


GTAATTTCAT 


TTCCATTGAT 


AGCATCAATT 


CCTCATTTAA 


CGGTATGGAT 


780 


55 


CTCCGCTATC 


TATTCAACAA 


GTATTGGTTG 


CTTCATGTCA 


GCATCCATAT 


TTATTTATCT 


840 




TATCAGATCG 


ATGATTAA 










858 
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(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: Genomic DNA 

15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



20 



25 



30 



35 



GG AATGCGAT 


ATGCCTTTTT 


ATTCATCGTC 


ATATAAATTG 


CATTTTCATT 


TTTACGTGTC 


60 


ATACGCTGAA AAGGATGCTT 


CACAAATTGT 


GGTGTAGTAT 


AACCAATTCC 


AATTTC C AAA 


120 


TACAACACTT 


TATCATCTTG 


ATGTTGCTCT 


AGAAAAGCAT 


TATAACGTTG 


NNGNTGNGCC 


160 


TGAAATTCAG 


CATCTTCAAC 


CATGCCAACT 


TCCGCTTTAC 


GTTTATTCAC 


TTCCATTGGA 


240 


GCATCACATT 


NTGGACATCT 


TGGAATCATC 


TCCCAAGGTA 


TAAGCATATC 


TTGTTGCGCA 


300 


ACAACCATTT 


TACGAATTAA 


ATCATCATTG 


CGATACGTGT 


GAGCATGACA 


ATGCTGACTA 


360 


CACTGNTGNA 


GTATATACTC 


CCCNTGTATA 


TGAAATACAT 


GAGTCATATC 


ATATTCAGCA 


420 


GCATCGAAAG 


CATTGTCTGC 


ATTCGGAGNT 


ATGATATGGT 


ACTGTTTACC 


CTCCACTAAG 


480 


GATTTTAATG 


CGAGATAAGA 


CTGACCTACA 


GGTTGGATCT 


TAATAATTTA 


NTGTAATAAA 


540 


ACGACTC TC A 


AATGCCCAAT 


ACTCTTGCCA' 


ACTGCCATAA 


GGATGTAAAC 


TCGCTTGCAA 


600 


CATATCAAAG 


AAGCGATATT 


T TTC AAT AAA 


ATCTGGGAAA 


TTTTCCGTAA 


AACGCTCTCC 


660 


TACATATGTA 


AATCCGTCAG 


ATGCAGACAT 


GCCTGCACCA 


ATTCCAATCA 


CTATCGCATC 


720 


TGCTTCATCA 


ATCGCAGTAC 


GCAATACTTC 


AGCCTGCTTT 


GTCTTTTCAT 


CCATTAAAAG 


780 


AGACATTGCA 


TTCCATTTAC 


TACTCTGCAT 


CACGGCTCAA 


TGCCTCCTTA 


TACAGTTGGT 


840 


AAATCC TTAT 


CTTGGAAATA 


CATTGAACAC 


GACTTTCAAT 


GTTGAATTTG 


GCTCTGCGAG 


900 


ATAGCTTTCT 


TCTGTTCGAC 


AGCATTTCTG 


CTGCTTCATC 


TTGAGAAAAG 


CAATACACTG 


960 


TAGATATACA GCAAAGCGAC 


ATGATTTAAC 


TATGTGGGCC 


AGGC 




1004 



40 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

4S (A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

55 

ATATTAGCAG CTGTGCTTGC TTGTATTTTA GCTGGTGTAC CCTATCAAAG CAATTATTGA 60 
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TAGTTTAACT 


ACATTTTCAG 


GAATAGAGCA 


TAGATTGCAA 


TATGTTGGTA 


CTAATAGAAC 


120 




TAATAAATAT 


TATAATGATT 


CCNAAGCAAC 


AAACACGCTA 


GCAACACAGT 


TTGCCTTAAA 


180 


5 


TTCATTTAAT 


CAACCAATCA 


TTTGGTTATT 


GTGG TGGTAT 


TGGATCGAGG 


GAGATGAATT 


240 




TGACGAACTC 


ATTCCTTATA 


TGGAAAATGT 


TNCGCACGAT 


GGTTGTATTC 


GGACAAACGA 


300 




AAGCTNAGTT 


TGCTAAACTA 


GGTAATAGTC 


AAGGGAAATC 


GG i CANT. GAA 


GC G AAC AATG 


i a f\ 




TCGAAGACGC 


TGTTGATAAA 


GTACAAGATA 


TTATAGANCC 


AAATGATGT I 


GTATTATTGT 




10 


CACCTGCTTG 


TGCGAGTTGG 


GATCAATATA 


GTACTTTTGA 


AGAGCGTGGA 


GAGAAATTTA 


a q n 




TTGAAAGATT 


CCGTNCCCAT 


TTACCATCTT 


ATTAAAGGGT 


v_> J. 1 GAG iAllo 


A I\jbA J/G A 1 G 






AAACGAAGAN 


CGATCAACAA 


GAATC AAATG 


AAGATAAAGA 


TGAATTAGAA 


TTATTTACGA 


600 




GGAATACATC 


TAAGAAAAGN 


CGGCAAAGGA 


GAAAGGTCAA 


AGGCTACACA 


TTTTTCTACN 


660 


15 


TCAAAATAAA 


GATGATACAT 


CTCAACAAGC 


TGATTTTGAT 


GAAGAAATTT 


ACTTGAG 


717 




(2) INFORMATION FOR SEQ 


ID NO: 79: 








20 


(i) SEQUENCE CHARACTERISTICS: 










(A) 


LENGTH : 650 base pairs 










(B) 


TYPE: nucleic acid 












(C) 


STRANDEDNESS : single 










25 


(D) 


TOPOLOGY: 


linear 












(ii) 


MOLECULE TYPE: Genomic DNA 








30 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 79: 








CTCTTCTTCA 


TCAACATCGA 


TCATACTTAC 


TAAATTGTGG 


ATGTGAAAGC 


TGTGATGAGT 


OU 




ATATGTACTT 


CTCGNNCAAA 


ACGGTTTAAT 


GTNTCCTCCT 


1TTCTCCAGG 


GTGGTATAAA 


120 


35 


AATCGCCTTA 


ATTGCAACCT 


TAATGTTAAG 


TATCGTATCT 


TCTGCAAGAT 


AAACGGTACT 


180 




CATGCCACCG 


NCGCCAAGCT 


TATCTACAAT 


TTTATATCGC 


ACATTTATTA 


TTTTACCTAT 


240 




CATACTTTAT 


CACCTNCAAT 


AGCCGCGAGT 


ATGAAAGTAA 


CGTTATCTTT 


CGAATGGTTA 


300 




TCTAATGCCA 


ATTGCATTAA 


TTGATCACCA 


TGATCTTC T A 


TTGTACCTTC 


TTTTACTAAC 


360 


40 


AAACGCTTAA 


TTTCATTGTC 


TTTAACATAA 


TCAGTTAATC 


CATCTGAATT 


TAATAATAAA 


420 




TAATCATAAA 


AATTTAATCG 


CTTAATAAAC 


AAATCTGGAC 


TCACACGTTT 


ATCTGTGCCC 


480 




ATCACCTTCG 


TAATAATATT 


ACGTTGTGGA 


TGTGTAAATG 


CTTCTTCCGG 


CGTAATTTGA 


540 




CCCGTTAAAA 


CAAGATGATT 


ACAAATGAGT 


GATC ACT AG T 


AATTTGCACA 


ATTGTCTACT 


600 


45 


ATTAATACAT 


AGGCTCTAGA 


ATCACCGACA 


TTTGGTATCA 


CAACTGATTT 




650 



(2) INFORMATION FOR SEQ ID NO: 80: 

50 (i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 





CACCAAGAAC 


TTACTTATAT 


TTTAGACAAC 


ATAAAAGGGA 


ATAATAATTA 


TGGTAAGGAA 


60 




TTTGTTGCAA 


CCGNTGAAGA 


AACATTCGAC 


ATTGAATAAA 


GCGGGGTGAA 


GCACTATGAA 


120 


10 


TCAATGGGAT 


CAGNTCTTAA 


CACCTTATAA 


GCAAGCGGTT 


GATGAGTTGA 


AAGTGAAACT 


180 




TAAAGGCATG 


CGCAAACAAT 


ATGAAGTTGG 


TGAACAAGCG 


TCGCCAATAG 


AATTTGTTAC 


240 




TGGTCGTGTT 


AACCCGCATC 


GCTAGTATTA 


TAGATAAGGC 


AAACAAACGA 


CAAATACCAT 


300 




TTGGATAGGT 


TAAGAGAAGA 


AATGTACGAT 


ATCGCTGGGT 


TTAAGAATGA 


TGTGCCAATT 


360 


15 


GGTAGAAGAT 


ATTGATGTTG 


GCCGCCAATA 


TTTTTAAGGA 


CAAAGAAAAG 


ATTTTAAAGN 


420 




TATTGGAGAC 


CGAGATTATT 


TCCCGNACAC 


TAAAGGAAGG 


TGGGTACCGT 


CCGCTTCANG 


v 480 




TCAATATTGG 


ATTTCCCAAC 


TGGACCAATA 


CAAGGCCAAA 


ATTT 




524 



20 (2) INFORMATION FOR SEQ ID NO: 81: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 751 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 



35 


CGTCTATCAT 


TGTTAACATA 


TATAGCACCT 


TCCTTATTTT 


AATGTTGTTT 


TAGTTGAATG 


60 




ACAGTAAAAA 


GGTTGTTAAG 


ATACTCATAC 


ATTTTTATGT 


GTAAATATCT 


ACAAAGTTAC 


120 




CCANCTACTG 


ACAATGTTTA 


TTTNAGATAG 


TATATGTAAA 


TTCACAGATA 


TGCTAATTGC 


180 




TTAAAAAATG 


ATTAAAGTGT 


TGGCTCCAAG 


CAATGATACT 


TTAGAAATTT 


ATTTATCATC 


240 


40 


TNGACTTTAA 


AAATTATATT 


ATAAATGACG 


TAACTGACAA 


CAGATATACT 


TAGTAATGAA 


300 




GATGTGTAAT 


GTAATTGTTT 


AAAATTGATC 


TCCAAGCAGA 


TTTTATTTAT 


CATTTAATTT 


360 




AAATAGCAAG 


TGGAGGTACA 


AGTAATGAAA 


TTTGGAAAAA 


CAATCGCAGT 


AGTATTAGCA 


420 




TCTAGTGTCT 


TGCTTGNAGG 


ATGTACTACG 


GATAAAAAAG 


AAATTAAGGC 


ATATTTAAAG 


480 


45 


CAAGTGGATA 


AAATTAAAGA 


CGATGAAGAA 


CCAATTAANA 


CTGTTGGNAA 


GAAAATTGCT 


540 




GAATTAGATG 


AGAAAAAGAA 


AAAATTAACT 


GAAGATGTCA 


ATAGTAAAGA 


TACAGCAGGT 


600 




TCGCGGTNAA 


AGCAGTAAAA 


GATTTAATTA 


AAAATGNCGA 


TGATCCGTCT 


TAAAGGAATT 


660 




TGAAAAAGAA 


GAAGACGCAA 


TTAAGAANGT 


C TTG AAC AAA 


GACTTTAAGA 


AAGCAAAANG 


720 


50 


TACGTNGGGA 


TAACATGATA 


TGATGTTAAA 


C 






751 



(2) INFORMATION FOR SEQ ID NO: 82: 

55 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 785 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 





CCTTTGTTTA 


NCATCCTTAG 


TCAGTGTCAT 


AAGGTTGTCC 


CATCAGAAGG 


TAGNCCCCAC 


60 




AATAGTTGAA 


ATCCTCCCAC 


AATAGGGGTA 


GNC TCCTGC A 


TCTGTAAATA 


CCTAC TGGNN 


120 


15 


TGCCANATAC 


TTTTTCTGCT 


GGTGTGTCAA 


AGAAATTTGT 


TGTTCCATTT 


GAGAGAACAC 


180 




TAATAATTTT 


GACATAATCT 


GCATATCGCA 


TATAAATTGT 


TGCGTTATCA 


CGATAATCTT 


240 




CATGTAAATC 


TGCTAAAGCG 


TTAATAATAG 


CATCATACAT GTCTGCTCCC 


NCAACTTCTT 


300 




TAACAGATCC 


ATTATAAAAT 


GACATGTGTT 


CTAATCCAGA 


TTTAGGACTT 


AC TGCTAAGG 


360 


20 


CATCTTTACG 


CTCTTTAGCT 


GCTAATCCTG 


ATTGTAGTGC 


GTTTTCAACC 


CAGTTTACTA 


420 




AATCTACATC 


TGATCCATGA 


ATTACAGTAT 


CTGAAATTGC 


AGCAAATACT 


TTGAANTTAT 


480 




TAAGTAGTGA 


ACTTGACTGT 


ATCACCTTGT 


GCTTTTAATT 


CTTGTGCTGT 


NTCTACGTCT 


540 




GTAATGAAAT 


CATCATCGTC 


TAAAGTGTAT 


GAAAC TCTTG 


GAATCTCTAA 


ACC TTTAATG 


600 


25 


TTAGNTAGAC 


GAGCTTTTAC 


ACGTAATTGG 


GTTTNNAGCA 


AATGGCTCTG 


AAACAATTTC 


660 




TTGTAGAAAG 


TGGGTTGGGG 


AAGAGCTTAT 


CTCCACCTGA 


ATCATTTCCT 


GTTGGTAAAG 


720 




CGTGTAATAA 


ACGTTGTGCC 


TCCATTGAAG 


GTTTTTCAAA 


TTCATTTGGT 


AAAATCGCTC 


780 




GTGCC 












785 



30 

(2) INFORMATION FOR SEQ ID NO: 83: 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 



45 





CTTNATAAAN 


ANCNATCNTA 


TGCCAATTTG 


ACCATTGTTT 


GAATCCATAT 


ANAAACCGGC 


60 




NACGGTTCTT 


TTCAAATATA 


ATAGTAAGTG 


TATAATGAAA 


ATGTAAATAT 


TATTAAANAT 


120 




GGGGGTTCAC 


TCAATGAAAA 


TGAAACGTTT 


TATAGCTATT 


GTAATGGCAT 


TATTTTTAGT 


180 


50 


ATTAGNTGGT 


TGCTCTAATT 


CTAACGATAA 


TAATGAAAGT 


AAAAAAGATG 


ACGCAGACAA 


240 




TGGTAAGAAA 


CAAGAGATTC 


AAGTTGCAGC 


GGCAGCAAGT 


TTAACAGATG 


TAACCAAGAA 


300 




ACGAGCTTCA 


GAATTTAAAA 


AAGAGCATAA 


AAATGCTGAT 


ATTAAATTTA 


ACTATGGTGG 


360 




ATCAGGGGCA 


TNAAGAAAAC 


AAATTGANTC 


AGGCGCACCN 


CTTGTTGACG 


TATTNATGNC 


420 


55 


TNCCNAANTN 


CTAAAGATGT 


AGATGCATTN 


NNAAGACAAG 


GAATNNAGCG 


CATTTGATAT 


480 




CATATNAATA 


TGCGNNNNAT 


AGTCTAGTAT 


TAATTGGTGA 


TAAAAGATTC 


AAATTACACT 


540 
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TCAGTAAAAA GACTTAAAAG NCAATGATAA ATTAGCATTA GGTGAAGTGA AAACTGTACC 600 

AGCAGGAAAA TATGCGAAAC AGTATTTAGA TAACAATAAC TTATTTAAAG AAGTCGAAAG 660 

5 TAAAATCGTT TATGCTAAAG ATGTAAAACA AGTATTAAAT TATGTTTGAA AAGGGTTAAT 720 

GCGAAACAAG GTTTTGTGTA TAAAACTGAC TTATATAANC AANNCNNAAA AATTGATACT 780 

GTAAAAGTAA TTAAAGAAGT AGAACTTAAG AAACCAATCA CATACGAAGC TGGTGCTACA 840 

TCAGATAGTA AATTANCAAA AGAGTGGATG GATTCTTAAA TCAGATAAGC TAAGAATATT 90 0 

10 AAAGATACAC TTTGCAGCAT AAGA 924 

(2) INFORMATION FOR SEQ ID NO: 84: 

T5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 



30 



35 



GGCACGAGCT 


CGTGCCGAAC 


TTACGGCACG 


ACGCGATACN 


ATAAATACAT 


TTGTTACAGT 


60 


TACGCTACTA 


ATCGTTTGTG 


CTACATCTTT 


AAAATCAACC 


GCAATACCTT 


TGTGTTTGTG 


120 


AAGTAACGTG 


TTTAACTCTC 


TCGTTTTAGA 


TAATAAGCTC 


ATGAATTTCT 


CTCCTTGTGT 


180 


ATATTTTTAT 


AG AAT AAA TG 


CACTTAAATC 


TTTATNTGTT 


GAAATTGATT 


TTAATTTATC 


240 


ATCAACATAT 


TGTGGGGTAA 


TATCTACAAC 


TGCATTCGGC 


ATACTTGGTG 


CTTCGAATGA 


300 


TAAATCTTCT 


AGCATCTTNT 


CTAAAATTGT 


ATGAAGTCGA 


CGGTGCAACA 


ATGTNGTCTT 


360 


GTATCTTGGA 


TTCACTTGGA 


TAAGCAATCT 


CAAGCTTAAG 


GCGANTAATT 


GCTTCCATCG 


420 


GGTAAAGTTT 


AC AGGNAAC T 


TCTTCTTNTT 


TGGGAGCAAT 


GCTTCATATT 


TGGNNTAATT 


480 


AATGACAATG 


TGGGGTCTGG 


NCAAAATTCT 


TACAAAATCT 


TCCTNCCCGT 


TANCTA 


536 



(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

CTACCTTCAT GTTTTGTAAA GTTGATTAAA GCATTCATTG CTTGTGTATC ACTTGCATTT 60 
TCAAAAGTTA CTCTACCAAT GTCGTGGTAA TAAGAATGTT CAGGCCCAAT ACCAGGATAA 120 
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1 <jAAAI AljA 


ATvaTGC I Alj 1 




1 1 1 L. 




i on 




TACATTTTAG 


TACCATGTAA 


TACGCCAGGT 


GATCCTTTGN 


CAATTGCAAG 


TGCATGTTTA 


240 


5 


TCAGTATCCT 


CGCCTTGACC 


TGCGGNTTCA 


ACACCGNATT 


AATGCAACAT 


CATCTTTAAT 


300 




AAATGGATAA 


AAGGCACGAG 


ACCGATTGCA 


TTTGAGCCAC 


CACCGATACA 


TGCTACAATT 


360 




GCATCCGGAA 


GTCGACCTTC 


TTTCTTCAAT 


ATCTGTGATT 


TNATTTCTTT 


ACCAATGACA 


420 




CTCTGAAAAT 


CTCTAACAAT 


CGTTGGGAAT 


GGGTCTGGAC 


CTAATGC AG A 


ACCTAATAAA 


480 


10 


TAATGTGTAT 


CATCTACATG 


ACTTACCCAA 


TATTGCAATG 


CTTTATTAAC 


TGCATCCGAT 


540 




AAAGTCCCTT 


GANCTTCTAC 


AACTGCCACA 


ACCCTTNGCA 


CCAAGTAATT 


CCATTCTAAA 


600 




TACATTAAGT 


TGGTG TCTTT 


TAATATCTTC 


ACTTCCCATA 


AAGACAACAA 


GTTCCATATC 


660 




AAATAATGGA 


GGAACCGGAG 


GACT AGC T AC 


AACATGTTGA 


CCCGCACCAG 


TTTCAGNAAC 


720 


15 


AAGCTTCTTC 


TTGGCCATTC 


TTTTTGCAAG 


CAACGGTGAC 


TAACGCATAT 


AATTTATGGC 


780 




GCCGTATGAT 


TAGATCCTCT 


CG 








802 



(2) INFORMATION FOR SEQ ID NO: 86: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 662 base pairs 

(B) TYPE: nucleic acid 
2S (C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 





CTATTCTTCT 


ATGTCTTTGA 


CAAGCGCGAA 


TATTTTTCGT 


CGTTGCTTGC 


CGGTAATTTG 


60 


35 


AAACGGATCT 


ATGACGCTTA 


CATCGACTTC 


CACATCAAAT 


CCGTTATCAA 


GTAGTAATGT 


120 


TTCTTTATTG 


CCTAATTCAA 


CACCCGAGAT 


GACAACTGTT 


GNTGTACCGN 


CATCTTGAGT 


180 




GATATAACTA 


GTAATTATTG 


GCATCTAATC 


ATTCCAATCA 


GAACGGGAGG 


TCTGAAAAAT 


240 




CTTCTTCACT 


ATTGTCAAAC 


GGATTATTGC 


CAGTTTGAGC 


TTGTCCTTGT 


TGTTGATAAT 


300 


40 


TGTTGTTTTG 


NTGTTGGTTG 


TTATTCTTCG 


GTTCTAAGAA 


TTGAACGCTG 


TCCGCTACTA 


360 


CTTCTGTCAC 


AAATACACGT 


CGCCCTTCTT 


TGTTATCGTA ACTGCGTGAT 


TGTAAACGTC 


420 




CATCAACGCC 


AGCCAATGAC 


CCTTTGGATA 


AATAATTATT 


TACATTTTCT 


GCTTGTTTTC 


480 




TAAAAGTTAC 


ACAGTTAATA 


AAGTCTGCCT 


CACGTTCTCC 


TTGAGCGTTA 


GTAAATGTTC 


S40 


45 


TGTTAACTGC 


GATAGTGAAA 


GTGGTNACAC 


TCACACCATN 


TGGGCGCTGT 


TCTATATTCT 


600 




GGATCTTTTT 


GGGTAAGCGT 


CCCACTTAAT 


AC TGTTCTCG 


TNNTAACATT 


ATTTGNTTTC 


660 




CC 












662 



(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 656 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
5 (ii) MOLECULE TYPE: Genomic UNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87; 



10 


CTGGTAAAAC 


AACATTAACA 


GCAGCAATCG 


C T ACTGT ATT 


AGCAAAAAAT 


GGTGACTCAG 


60 




TTGCACAATC 


ATATGACATG 


ATTGACAACG 


CTCCAGAAGA 


AAAAGAACGT 


GGTATCACAA 


120 




TCAATACTTC 


TCACATTGAG 


TACCANACTG 


ACAAACGTCA 


CTANGCTC AC 


GTCGNCTGCC 


ISO 




CAGNATCCCG 


CTGACTACGT 


TAAAAACATG 


ATCACTGGTG 


CTGCTCAAAT 


GGACGGCGGT 


240 


15 


ATCTTAGTAG 


TATCTGC TGC 


TGACGGTCCA 


ATGCCACAAA 


CTCGTGAACA 


CATTCTTTTA 


300 




TCACGTAACG 


TTGGTGTACC 


AGCATTAGTA 


GTATTCTTAA 


ACAAAGTTGA 


CATGGTTGAC 


360 




GATGAAGAAT 


TATTAGAATT 


AGTAGAAATG 


GAAGTTC GTG 


ACTTATTAAG 


CGAATATGAC 


420 




TTCCCAGGTG 


ACGATGTACC 


TGTAATCGCT 


GGTTCAGCAT 


TAAAAGCTTT 


AGAAGGCGAT 


480 


20 


GCTCAATACG 


AAGAAAAAAT 


CTTAGAATTA 


ATGGAAGCTG 


TAGATACTTA 


CATTCCACTC 


540 




CAGACGTGAT 


CCTGACAAAC 


CATCAAGATG 


CCAGTGAGAC 


GTATCTCAAC 


ACNGTCGGGG 


600 




TCTGTTGGTA 


CAGGCCCGTT 


GACGTGGGCA 


ATCAAGTGGT 


GAGGAGGTGG 


GATCAC 


656 



(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 390 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 



40 


GAATGCCCAC 


ANAATGATGC 


TGGCAATGAT 


TTTCACTTAG 


TTATGAGCGG 


TAATGAATTT 


60 




TGCGGTAATG 


CGACGATGTC 


ATATATACAT 


CATTTGCAGG 


AAAGTCATTT 


GCTTAAAGAC 


120 




CAACAGTTTA 


AGGTGAAGGT 


ATCTGGCTGT 


TCGGATTTAG 


TGCAATGCGC 


AATTCATGAT 


180 




TGCCAATACT 


ATGAAGTTCA 


AATGCCACAA 


GCC CATC GTG 


TTGTGCCAAC 


AACAATTAAT 


240 


45 


ATGGGTAATC 


ATTCATGGAA 


AGCATTAGAA 


ATTATTTATG 


AAACATATTG 


TACATTATGT 


300 




GATTCCNAGC 


TAAACAAAGT 


NACAACTTGA 


AATTCAACAT 


TTGGNTGGAA 


GCATTTGTGC 


360 




CGTTGANCAA 


CAAATGGAGT 


CACAAATATT 








390 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 



10 


GGNACGAGCG 


CATNATTAAA 


AACATATCAG 


GTTATGTATG 


CGCATAAAAA 


AC AGC CTTGA 


60 




AAAACCTTTA 


TAAATCGGAT 


TTTCAAAGCG 


TGTTTTGAAA 


GGTATTTAAT 


TAAAACTAAG 


120 




CATGTTGATG 


TAAATCATCA 


AAGTTTGTTA 


AAC GTTGTTG 


CCACTCATTA 


TCACTAATAT 


180 




TATTTGC TTG 


AACATAACGA 


TTACGCTCAT 


GTTTAGCACA 


TTCATAAGAG 


CATGCACCTA 


240 


15 


AATATTTAGT 


TTCGNTTTCT 


TCAGAAACTA 


ATATTTGTTT 


ATTACATTCT 


GGGTTAGCGC 


300 




AATTAATATT 


AACGNTCACA 


TGGGTTTGGC 


ATCAAAACCC 


AATCCTTACC 


AATAATTGTT 


360 




TTTTCAACTT 


GGGTTGATAT 


CAACACTGAT 


ACGGATCAAC 


AANTTACAAT 


ACAATTTNAC 


420 




CCG 












423 



20 

(2) INFORMATION FOR SEQ ID NO: 90: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 



40 



45 



50 



TTGGTGC AC C 


ACCTCAAAAC 


TTTGGTACCA 


TTGATGCATT 


GTACATTAAT 


TGCGCCAAGT 


60 


TTTTGTTAAT 


GCCACCAATG 


ATTCAGGTAA 


TAATTGTCCC 


AAAAGCAGCA 


GTTACAATTA 


120 


AATCTACATC 


TAATTGAANC 


AATTGTTCTA 


ATTCTTCTGA 


TCCACTTAAT 


TTTTCAGGCT 


180 


GATATACAGG 


TAAATCATAT 


TTCATTNCAA 


CTTTTTTAAC 


TGGTGGTGGT 


GTCATAACAC 


240 


GTTTACGTCC 


AACAGGTCGA 


TCTGGTTGCG 


TTACGACTGC 


AATGACATCA 


TGTTCTGCAA 


300 


TAAGCATTTC 


TAAAACAGTT 


GTTGAAAAGG 


CACGAGTACC 


CA^TAAATATT 


ATTTTAGTCA 


360 


TTTATAAAAT 


ATGCCTCCAC 


TTCTTTATCT 


GTTAAAATAC 


GGTCCGCACG 


TTCNGTAAAA 


420 


GGGAGACCGT 


TCATTTGATC 


TATAATATGC 


AAAATCATTC 


TTGCTACATC 


TTCATGTGCA 


480 


GTTAGTTCAA 


CTTTGTTCCC 


ATTGACGTCA 


TAACTTTCGA 


CAACTATCAT 


TTTACTTCTT 


540 


GTCACTTCGC 


CGTAAACATC 


TGGCAATGTA 


ATTGAACCTT 


CTAAGTCTGT 


TATTGTTTCA 


600 


TTTGATTGAC 


TAATAATTTT 


CGGATTAACA 


AGTTGGTAAT 


AATCCTTCCA 


TTTC CAT ATT 


660 


CAATAATTGG 


CACTTGGCAA 


TGACTTGATT 


AATTTGAGGG 


TGCACATAAG 


CCAGCAGCTT 


720 


CTTGGTGCAT 


ACATTGGTAT 


CTTCTTAAAT 


CTTGGTAATA 


ATCTTTTTTA 


CGAATCACAA 


780 


TTGTTTTAAC 


TTGCTTGCGC 


TTTTTCCGGT 


TAAATAGGAT 


GCGATGCTGG 


GTACTAACTT 


840 


TTTTAATCGC 


CAATACTCTT 


ACTCCTCAAT 


AAATCAATCA 


ACTATATACC 


G 


891 



55 

(2) INFORMATION FOR SEQ ID NO: 91: 
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(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH; 652 base pairs 

(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

15 



20 



25 



GGCACGAGCT 


ATAAATCCAA 


TGATGAATTG 


TAAAAGTGAA 


TAATTGAGAA 


AAAGGTTAAT 


60 


ATCAAATTTT 


GGTGTCATCA 


TTAATGTAAG 


TTCCTTGGCT 


AACGTTGAGA 


AAGTTGTTAA 


120 


GCCACCTAAA 


AAACCGGTGA 


CAAAGAACGC 


AGGGAACCAT 


GAGATTGAAA 


TTGATAGGCC 


180 


TATAGTTAAT 


CCAATTAAAA 


AACTACCAAC 


TAGATTTACT 


ATCAATGTTG 


CGATAGGTAA 


240 


CTTTGAAGTA 


AATTTATGAT 


TAAAATAATC 


AGTAATGGCA 


CTTCTAGCAA 


TTGCGCCAAA 


300 


ACCGCCGCCA 


ATCATGACTA 


AAATGATTGA 


TATCATGATA 


AACCACCACC 


TAGTTTTATA 


360 


CCGACGTAAC 


ATAACAAAAT 


CCCAAAGACA 


TAACTTGTTA 


CAGCATATAG 


TAGTAAAGTT 


420 


ATAAATTGTT 


G ATG ATC AAA 


CATATGTATT 


AATTCTTAAT 


TGAAATGTTG 


AAAAAGTCGG 


480 


CTAAAGCACA 


AGAAAACAGT 


CGTAATAGCT 


TTTTTTAGGG 


TCGGATGGTT 


TGAAAAAATG 


540 


CAATCGTTAA 


GC TGTTAGC A 


TCCCATTACA 


AAGGCACCAG 


TCAATGGTAT 


CAGTGTCCGA 


600 


TGGAAC TCCG 


CAGTATCAGA 


AAGATGAGGT 


ACGTATAAGG 


CTAAGCACAC 


CG 


652 



30 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 1142 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

45 



GAAACC ATC C 


CCNCGTGCTC 


GTCACTGATT 


GTTCCGCCAC 


CACCTTGTGT 


TCCACCACTT 


60 


GATTTGATCA 


GCAGGATTGA 


AAGTACCTCT 


NCCACAGTAA 


TACCCATTAT 


TAGTCCCACC 


120 


AAAACCTGTA 


CATTACCTGC 


TGATGCTCCT 


TTCACCCATG 


GGCTCGTATT 


ATNACGACGT 


180 


GTAAATGTCG 


TNACCACATT 


TCCATTACGT 


TTAATAACTA 


ATTTGTC AGC 


ATATGTCGTA 


240 


ACATTACCAG 


CATGAGTATT 


GACTGTTTGG 


TTCGCACCAG 


GTGCAATTGT 


AATCGCTCCT 


300 


GCCGC TGTTT 


CAGTGACAGT 


TGGTTTCGCT 


GGTTGAACAT 


CTTTTACTAC 


AAATTTCGCT 


360 


GGTAAAGATG 


TTGCAAATGT 


ATGTCCATTA 


TAGATGACAT 


CATATTTTGC 


ATTAACGACT 


420 


TGTGCAGTAT 


TTGGTTTATT 


CATTGCTGCC 


CAGTTTGCAT 


CGTTTGTACC 


CGTAGTATCA 


480 


CGATTCCATT 


TATACGTAAA 


TCCATCTGTT 


GGTAAACCTG 


AAGCGTTTTG 


CATATGTGCA 


540 
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TATCCTGATG 


CTTGCGTACC 


ACT TGC TAAA 


GTGCCACCAA 


CTGTTCSTTVST 

v.- & x x vj x X X 


X c\jn\j x x 1 


c r\ n 

D U U 




TGAGGGAATT 


CCAAATTGAT 


ATACGTTCAC 


AGTTACAGGA 


AC TCGTTTAG 


CAGCTGAAAT 


660 


5 


ACCTGGATAT 


GTGACATCGA 


CATTTAAATG 


TTGAACGCCT 


GCTTGCTGGT 


TATTTGGTTG 


720 




TTGTCTATTT 


GCCCATGCTG 


CTGTAATACC 


ATTCGTATTA 


GTATTTGGAT 


CAAATGTAAT 


780 




GTAATCAATA 


GCGTTTGTAC 


CATGTGTCAA 


ATTTTGACCT 


TTCACATCAC 


GTGATGGCGC 


840 




CTTAGCATTA 


GCAACTGGAT 


AAAC TTTG AC 


TGGAACTTCA 


ACATTACGCG 


TACCTTGACC 


900 


10 


ACTAGGTAAT 


GTTACAACCG 


CAGTTTTATG 


TGTGTTGGCG 


AC TGT ATTCT 


TCCATGTATC 


960 




TGGACTATCA 


TGCCATGCGA 


CCGCTGGCCC 


ATGTGGNGGA 


TTTTGGAATA 


AATCGTCTTC 


1020 




ATG AC CGAAA 


TCAAAACCGN 


CGCACCTTTA 


ATAAATACAG 


CGCTTCAGTA 


GTTGCTTGTA 


1080 




TTGTGGGGTT 


ACTGTACAAG 


AAGCACTATC 


ATTTGATCAC 


AGATCAATCT 


TGGTNNTNGA 


1140 


15 


AC 












1142 



(2) INFORMATION FOR SEQ ID NO: 93: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 





GGNANTNGAG 


GAANCAAANC 


NGCCCTTCNA 


CNCGGATTAA 


AACCTGTTNG 


AANCTGNNGA 


60 




AGNGTGGGNC 


NAANNCCTTG 


ANTGCAAGGT 


GCGANGGNCG 


NNTGCAAGNT 


GTNNAACNGC 


120 


35 


NNGNGANCGN 


TTGNCNGTGC 


ANTGNNCAAG 


ANTGGTGAAA 


ACCCNTGTGN 


TANATTGTGC 


180 




GNCCNNCTTG 


GTGANGNTGN 


GTTGNGCNGN 


NTCTTCAGNA 


GTCGCANCTG 


CAGNGTGTCC 


240 




NAT AAGCGC T 


NTNTGNACGG 


TTGCTGGTGT 


TGCNNANTCA 


TCTATCGCAA 


CATCGNTAAT 


300 




TGTTGTATCT 


CCAGTAATAC 


CTTGAATATC 


AGCAACTGCT 


TGATCATTAA 


TTTGCGTAAC 


360 


40 


ATCATTAGTT 


GNTTGTGCAN 


TTAAGATATC 


TTGANACGCT 


TTTTCTTTAG 


CTTNTAAAAC 


420 




TAAATCTTTT 


GCTGCATTTT 


TCTCTTCAGT 


TGTAGCGCCA 


GTTGTATTAT 


CAATTGCTTG 


480 




ATTTTGAGTT 


GTCACAGCTT 


GATCAACTTC 


ATTTTTCGCA 


TTCGATTTAA 


CTGCTGNTGC 


540 




TGGTTGTGTG 


CTTTGAATTA 


NAGAAGACTC 


AAGCTTGCAT 


GCCTGCAGTC 


GACTCTAGAG 


600 


45 


GATCG 












605 



(2) INFORMATION FOR SEQ ID NO: 94: 



50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1778 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: Genomic DNA 



5 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


94: 








AAATTCATAA 


TAAAGNCTTT 


TGNCATTAAA 


GTTNNAAAGA 


AGATAAAGAA 


NCTGCTNNAA 


60 




ATTGACGTAT 


CNNAAAGAAG 


TAAAAGAAGT 


TTAAATANTC 


AAATAATAAA 


TCNACAACAC 


120 


10 


AAGANATTTC 


TGAAGAACAA 


AAAGGTGAAT 


ATCAAAGAAA 


GTCAGAGGCA 


TTAAAAGAAA 


180 




GATTTATAAA 


CAGACAAAAA 


TCTAAAAATG 


AGTCTGTGGT 


TTCACTAATC 


GATGACGAAG 


240 




ACG AC AACGN 


AAACGACAGG 


CAACTTGTGG 


TTTCTGCGCC 


ATCAAAGAAA 


CCAACAACAC 


300 




CGACTACATA 


TACTGAAACA 


ACGACTCAGG 


TANCAATGCC 


TACAGTTGAG 


CGTCAAACTC 


360 


15 


AGCAACAAAT 


CGTTTACAAA 


AC AC CC AAAA 


CCATTAGCTG 


GATTAAATGG 


TGAAAGTCAT 


420 




GATTTCACAA 


CAACGCATCA 


ATCACCAACA 


ACTTCAAATC 


ATACGCATAA 


TAATGTTGTT 


480 




GAATTTGAAG 


AAACGTCTGC 


TTTACCTGGT 


AGAAAATCAG 


GATCACTGGT 


TGGTATAAGT 


540 




CAAATTGATT 


U i^lt.TV_ATCT 


AAC TGAACGT 


GAGAAGCGTG 


TAATCAAGCG 


TGNACACGTT 


600 


20 


AAGAGAAGCT 


CCAAAAGTTA 


GTTGATAATT 


TATAAAGATA 


CACATAGTTA 


GAAAAGACCG 


660 




ATTAAATGCA 


/"■IVJVf'lVJV'IVlV US* 

LAALAAAAAG 


TAAATACCTT 


AAGTGAAGGT 


CATCAAAAAC 


CGTTTAATAA 


720 




CCCAATCAAT 


AAAGT ANC C A 


TGCCAATAAT 


ATTAATGCAT 


GGCTGCAAAG 


CAAATAATGA 


780 




GTTTGTCGTA 


AAAA1ALLAA 


C ATTT AAAC T 


AGCAATAAAT 


AATATCCAAG 


TCATCATTTC 


840 


25 


ATTGATGCAA 


1 \_ 1 AG 1 AT AG 


TCCACATTCT 


AAACAGGTGT 


GGACTATTAC 


TTTTTTCACT 


900 




TTATATTACC 


uAAAAAAl IA 


TT ATGC TTAA 


CTATCAATAT 


CAATAATTAA 


TTTTAAGCTG 


960 




AAAAACAATA 


AAAA rGTTAA 


GACAACGTTT 


ACTTCAAGTT 


AATTATTATA 


CTGAAAATTC 


1020 




TGGTATATAA 


TGCTGTTAGT 


GAATATAACA 


GGGAAATTAT 


ATTGGTTATA 


ATATTGAGTC 


1080 


30 


TATATAAAGG 


AGAAATAACA 


GATGAAAAAG 


AAATTATTAG 


TTTTAAC TAT 


GAGCACGCTA 


1140 




TTTGCTACAC 


AACTTATCAA 


TTCAAATCAC 


GCTAAAGCAT 


CAGTGACAGA 


GAGTGTTGAC 


1200 




ACAAAATTTG 


TAGTTCCAGA 


ATCAGGAATT 


AATAAAATTA 


TTCCAGCTTA 


CGATGAATTT 


1260 




AAGAATTCGC 


CAAAAGTAAA 


TGTTAGTAAT 


TT AAC TG AC A 


ATAAAAACTT 


TGTAGTTTCT 


1320 


35 


GAAGACAAAT 


TGAATAAGAT 


TGTAGATTCA 


TCGGCAGCTA 


GTAAAATTGT 


AGATAAAAAC 


1380 




TTTGCCGTAC 


C AG AATC AAA 


GTTAGGAAAC 


ATTGTACCAG 


AGTACAAAGA 


AATCAATAAT 


1440 




CGCGTGAATG 


TAGCAACAAA 


CAATCCAGCT 


TCACAACAAG 


TTGATAAGCA 


TTTTGTTGCT 


1500 




AAAGGCCCAG 


AAGTAAATAG 


ATTTATTACG 


CAAAACAAAG 


TAAACCACCA 


CTTCATTACT 


1560 


40 


ACGCAAACCC 


ACTACAAGAA 


AGTTATTACT 


TCATNCAAAA 


TCAACACATG 


TNCATTAACA 


1620 




TGTNNATCAT 


GCAAAAGGAT 


TCTTTTAATA 


NACACTTTAT 


TGTTACACAT 


CAGACTCGCC 


1680 




TAGATATACA 


CATCCATCTC 


AATC TTT ATT 


ATCAAGCATC 


ATGTGCAGCT 


CCTGGATATC 


1740 




ACGCGCATAA 


TTTGGTACAC 


AGGGCATGCT 


AGCATTAA 






1778 



45 

(2) INFORMATION FOR SEQ ID NO: 95: 



(i) SEQUENCE CHARACTERISTICS: 

50 (A) LENGTH: 1568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

55 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

5 

GTGCTTTTGT TCAAGCTTCT GATCAATTTT TTATCGTCTT TTGTAATTTC GCGAATGTCT 60 

TCAAACATTG ATAAGACAAT CTGACCCACA TTTTGTAATT CTTTTTGAGT TTCTTGTAAT 120 

GCAACACCAG GTGCGTGATA AACAAGATCT TTGTGTAAGT GCTGAGGNNT ATAGTCANCA 180 

1Q GCNATATCTT TACCTGGGAC AAGCTTTG T A NCTATCCATG CTAAACCTGC TACANATGGT 240 

AATTGAATCA NAGTATTTGT TATGTTGAAG ATACCATGTG ATACTGCAAT CGTCATCGCT 300 

GGGTTTAAGT GCCATACATC TTGTAACANA CTAATCAAAT GAATC AC AAC TGGCAAGAAA 360 

ATTGTGNAGA TAATTACCCC GANTAAATTA AAGATGACGT GTACAAGCGC CGCACGTTTT 420 

15 GCAGCGATTG AN CCGGCTAA ACT AGC T AAG ATAGCTGGAA TTGTGGGACG AGACAATGTT 480 

ATCACCTAGG AACACAGGGA TTGCTGCGTT TNAGCTGATT AAATCTTGTT GATAAAATTC 540 

TTGTAAAATA CCAATCGNCG GACTTGAANT GTTGANCTAG TGCTGTTACC CCTGCGCCGA 600 

CAATGACACC AAGTATTGGA TGTGATGACA ATATCAAGCA TTAATTGNTT AAAATCCATC 660 

zo TAATGATGCT AAGGGGTTTA NCGGCATCCA CCCCATAAAT TCTAAGACCG AAGAAAAGAG 720 

ACCCCGAACC CGATAGTATG CGGCCAATGT TATTGATTTT AAGAGCGTTT AAAAGAAAAA 7 80 

GATTAAAAAT GCACCTAATG CTAAAATTGG CATTTGCATA TTCGCCTAAA TCTAATNCCG 840 

ATAATAAATG CAGTTACCGT TGTTCCGATA TTGGCACCCA TTATCACTCC AATAGCTTGT 900 

25 TTTAACGTCA TAAATCCAGC TGTTACCAGT CCGATTGTGA TAACGGTCGT ACCTGAACTA 960 

C TTTGTATT A AAATAGTTAC AACGATACCT GCAATAACAC CTAATACTGG ATTTGATGTA 102 0 

AATTTGTTTA AAATATCTCG TAGCCTGTCT CC TGCTGATG CTTGAAGCCC GTCTCCCATG 1080 

ATTTTTAAGC CGTAAAGGAA AATACC TAAA CCACCTAAAA AGGAGAAAAT GACTTCTGTA 1140 

ACCGACATTT CCATTATTTT CACCTCAAAT AAGC TTT AT A TTTAGATTAT CGCTTATAAT 1200 

TGTAAATTTA ATGTTAAGAT TAGGTAAAAT TATTTAACAA TATATGTTAT TTGTANATGA 12 60 

CTTGTAAAAT ATCGTCACTT ATTATGTNAA TTTTCAGTGT GAAATGGCAG GTNTGCAATA 1320 

ACGTGTTTAA CAAAATGATG CAATCAATCA TGTAATTATG TTTCATCAAA AAAATCATGT 1380 

GAGTGGGATA ACGAAATAAA G TTTGTG AAC ATATCATTTC TATCCCACTC CATGATTTGA 1440 

AATCACCAAA TAAAAATCTA TTAATGGTTT TCGTTATAAC AATTTGTGTT CTTTTAATAA 1500 

TGTCTCAATG TACGTACCTT TTATCTTTTT AAGGAATCCT GCTAATGCGA GTTTCTGCAT 1560 

TTTCGAAT 1568 



30 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 96: 



(i) SEQUENCE CHARACTERISTICS: 
4$ (A) LENGTH: 1317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
ID) TOPOLOGY: linear 

so 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96; 

55 

ACCCCATTGA TTCCTGAATC CCCTGAAAGA AGTTGAATCA CCTTGAAGAT CCTGATTCCN 60 
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CCTGTACTTT 


GTGAACCCAC 


TGAAACTTAT 


TGAAGAAGAA 


TCCCCTGANC 


CTGTCTGATG 


120 




TTGATAATGA 


TGCCGACACC 


GATGTGCTTT 


GTGATGCCGA 


TGTACTAGCA 


CTCATTGACA 


ISO 


5 


TTGATGTTGA 


TATCGATGTA 


CTTAAGGAAC 


C AGATGC AC T 


TGTACTTGTT 


GACTGCCTTG 


240 




TGACATTGAA 


TCACNTAATG 


ATGTAGATGT 


GCTTGTTGAG 


CTCGAGTCAC 


TTACACTTGT 


300 




TGAACCTGAT 


ATTGAGTCAC 


TTAAACTTGT 


CGATGTTGAA 


ACTGATACGC 


TTCCGCTCAT 


360 




TGAGTCAGAT 


GTTGAAAGTG 


ATGTACTCGT 


TGAATTTGAT 


CCACTGATGT 


TAGACGAATC 


420 


10 


ACTTGTAGAC 


ATTGAGTCGC 


TTT CTGATGC 


ACTGATGCTC 


ATAGAGTCAA 


ATTGAC T ATT 


430 




ACTTG TTGAG 


CTTGACTGCG 


AATCGCTCAC 


ACTTGTTGAC 


GTTGAT TCTG 


ATCCACTCAA 


540 




ACTTTGCGAG 


CTACTCAATG 


ATTTTGAATC 


ACTTAATGAA 


TCCGAAGTGC 


TAAGACTTGT 


600 




GGAACCACTT 


AAAGATATTG 


ATCCACTTAA 


TGAGTCGGAG 


TC AC TTGT AC 


TAGTAGAATC 


660 


15 


ACTCATTGAT 


ATTGAATCAC 


TTAGCGAGGT 


AGACTCGCTT 


ACGCTTTCTG 


AACCACTTAA 


720 




TGATGTTGAG 


GTACTCAATG 


AACCAGATGT 


ACTTGTTGAA 


GTCGAACCAC 


TTGTTGATTT 


780 




TGAATCACTT 


AATGAATCAG 


ATTCACTCAC 


GCTTTCTGAA 


CTTCTTAGTG 


ACGTCGATAC 


840 




ACTTAATGAT 


GACGAATCGC 


TTGTGC TT AC 


TGAATCGCTC 


ATCGATTGTG 


AGCCACTCAA 


900 


20 


TGAACTTGAC 


TCGCTTACAC 


TTTCTGATTT 


TCTTAATGAC 


GTTGAGACGC 


TCAATGAGCC 


960 




AGAATCACTG 


ACACTTGTTG 


AGCCACTCAT 


CGATTTAGAG 


TC AC TTTC AG 


AATTAGATTC 


1020 




AC TT AC AC TT 


TC TGAATC AT 


TTACAGATTC 


TGACATACTT 


TG TGAATC AG 


ATATGCTTGC 


1080 




GCTCATTACT 


TCACTAGCCG 


ATGTTGATGT 


ACTTGTCGAA 


TCACTTAACG 


ATATAGATAC 


1140 


25 


AC TCATCGAA 


CCAGATGTAC 


TCGCAC TTGT 


TGAGTCTGAT 


GTTGAATCAC 


TCACACTATC 


1200 




AGATAATGAC 


GTTGAATCAC 


TCATACTTGT 


TGATGTACTT 


GTTGAAAGCG 


ACATACTTTG 


1260 




TGAATCACTA 


GTACTTGTAC 


GCATCGAAGT 


ACTAGTTGAC 


AGCTGATGTC 


TCGTGCC 


1317 



(2) INFORMATION FOR SEQ ID NO: 97: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 



45 


CCCAAATTGG 


CGATAGCTAA 


TATTTGATAA 


TGATGCGCAT 


TTCATACAAT 


TTGTGACTTG 


60 




GCAATTATTG 


AATATTTATA 


TAATTTTTTC 


TCGTAATAAA 


CACAACTTTG 


ATAGCGCTAA 


120 




AACAGCTGTG 


TTAAGTTATG 


AGTTAACGAA 


TTTAACACAT 


TTTACTAGGG 


CGGCATTTAA 


180 




GAATATTTAT 


ACGTTTTTAN 


CGAATATTTA 


TTTATTGTAA 


NACGCTACCA 


AAAAGTTAGA 


240 


50 


CTTCCTTCCC 


ACTAAAGTNC 


CACTTTTTTC 


TTTC AAC TTT 


TTTAANAAAC 


GGATATGCAA 


300 




CTTTTAGTAT 


TGGTATCAAA 


ATGATTGTTA 


GGTCATATTC 


TATCAATATA 


TTTTTCTAAA 


360 




GAATTGC TTT 


TATTAACTTT 


CAATTATGTA 


CCTAACCTAA 


AAAGAAGC C A 


AGGCAACGAA 


420 




TGTTACCTTG 


ACTTCTAATA 


CATATTCAAC 


TAACTATATA 


TTCAATCATA 


CGCGCATGCG 


480 


55 


AGAGTGATTG 


TTGTACATCT 


ATAATGCGTT 


GATTTAAAGA 


ACCTTTATAT 


GGTAAATCAG 


540 




GTTTGAATAA 


GTGTTGTATA 


AATAGACCAT 


CGACTAAAAC 


GTCAATGTAT 


GATAATAACT 


600 
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L 1 LbALU 1 1\, 


TVP RPR !i TP & 


X X 1\jV- X rtrto X 


ATTPATATAA 


AAATPPAGTP 


PATACCCAAA 

«^ i- L,rvv^ 


660 




TTGTC TTTGT 


^ T'T'TV/** A A ?i R 
Ail ItLAAAA 


LL? lot 1 LuiUi 


A TPP TTTfi A P 
A l\>v, XXX L3/\L 


AAPATTTAAT 

AAuA XXX e\r\ X 


fiTAATATPPA 
vj x x n x v^. v-/i 


720 




A B rrirp* A A A A 


XLAj 1 1 L-LtLLA 


LL X Art 1 nuAL 


TTAPPPPAPA 
X lAuLLV-rtuA 


TAT ATA ATP A 
inini aa ivA 


TPATPGPAAT 
X \ x v.vjv.rtn x 


780 




CA ICTAATAT 


J. XL 1 vjL 1 AA X 


A X X X LA X LAO 


TPT A TTTPTH 
X L» X A X X iv- 1 L 


PPP AT ATP TP 
VjLLAXAIL XVj 


A A P TTTTPTC 

/VrtL- X X X lulu 


840 




ACAjL TTTGTT 


A T> A ^ A TVf* A 

A 1 AALA X LL A 


a P a r» a a tt a a 

AL ALAA 1 Irtrt 


ATPP APATPP 
A X L*LxALA ILL 


TP A TAP AT A A 
X\jA X AL A 1 AA 


APAPTHPATP 
AL rtv, X ULn X V— 


900 




TTAlTClTTL 


ACCva i LAAtA 


AAvjL 1AI I XLj 


AA1LL X A X X X 


TAflPA AT AT A 


ALL X XL3 X LL X 




1fi 
1U 


TGTTTAATGT 


/-</-» rp Ji TV JV JiPW* 

CL TAAAAO I L* 


X LAiLL I I X A 


UUL.UV.L X X LA 


T A TTTT A P 
lAlul X X xAV_ 


TPPTPPnP A A 
X L-L? X ULvTLnrt 


1020 




ATTTCTTTAT 


(jAUWiLL ill 


A A T"T> a fTClTZ A 
X X AL i vjrVjM 


rvTTf^A A PTP 

L.U 1 1 vj/vrt v. 1 u 


PATTPPPTAA 


PTAAPPPAPA 


1080 




lul 1 LL? 1 i 1 A 


arrara tv a a 


PTPTTTTAPP 
L 1 \j X X X X n\j\j 


A TT ATP A TTP 

AX 1AILA1 X w 


WPACAAGTTC 

i « v» rv/\vj x x v_ 


PGGCATTTAA 


1140 




ATCCTTTTTC 


ALsiN X bL I 1 LA 


A A ITfTrPlT 1 


PPPWNJATP AP 
L L VjXMI>J,A. X LAL 


ATTPAATAAP 

rt X X V- X -TT-TiVw 


AATPPA1MPAA 


1200 


15 


A X CWjIvA X A X 


TTPY1NTMPPT 21 
X X \ jL»X>JX>JLL X A 


AVjL? X X nrtL 


APTTMPPP A A 


TAAGAGTNGG 


PPPPAATAPP 


1260 




pptvjttvt a pp. 

LOXN 1 i\, X AVJO 


PPTTTPA AAT 
uL 111 LAnn 1 




TTTPPPATAP 

XXX V^, V_TV_7i» X rt^. 


TC AC AAT AG T 


GAATGAAACC 


1320 




n\_V-nV_ XLLyLrV 


T 7v ATA APPAT 


A ATP TTTTTC 

Art IV. 1 X IX Iv. 


AAAATPTAAC 
Art/vn lv, i ***fcV^ 


TTTTCAAAAG 


GTGTAACATC 


1380 




111 ALL* X nvn 


TP A T A & TPP A 


J^~tWjlJ-\. X X X -L VJ 


AT AAT ATPP T 


TT A TP TGT AA 


TGTC TTT AAT 

X *J X W xxx x 


1440 


20 


r\ 1 L 1 


V— 1 V, X \_ X X 0\J 1 


PTAAACGAPA 


AAAACGATCC 


GTTAGCGANN 


CACTCGGCGT 


1500 




AC TCGTAAAT 


TV /Trv"» TV 1V.TV"*/** TV T4 

ACTGANCCAA 


tv TV"* TV"* AT* A T* A 
A X O 1\- A 1 A X A 


AXIL l\jl\lLLA 


TTMriPfTJTPTP A 
X JNLjL Vj X V3 IVjA 


T A A PPTMTP A 
X AALu X XX X LA 


1 JDU 




TTTCTTTAAG 


AATNACAAGC 


GTAAATGCTT 


GNGCTTCTNG 


AGATGTCTCC 


CAGTCTGGAC 


1620 




CATAGAAAAC 


AGNAGCTGTN 


TCATACAACC 


CTATCTAGCC 


CATTGAAATC 


GTTGCACGTT 


1680 


25 


TATTTTTAAA 


TAACTCAGCA 


ACATCATCTG 


TTTC TTTTAA 


TTTATAGTTA 


AAAGCGCCAC 


1740 




TTTTATATAA 


AATTGGTGCG 


TTATTCGGTA 


CAGNATCTTT 


CAAACGATTT 


ATACGATAAA 


1800 




GTAATGCATC 


ATGTAACACA 


TCGATACGTT 


CATAAAAGAT 


TTCCCAGAAT 


TTCGTCATAT 


1860 




TACCGGCAGA 


TTCTAATGCC 


ATTCTAGGTA 


AATTAAGTGT 


NACAACACCA 


AGATTACAAC 


1920 


30 


GACCATTATT 


TTCAAAATGA 


CCTTCCGCAT 


CTTTCCAACT 


TGGGTAAAAA 


TGAACGACAA 


1980 




CCCATTGGNG 


CTTTGAAATC 


ACCTAATATT 


TCTACGAGTT 


TGGTCAATAA 


TTTAAAATAT 


2040 




CTGGATACAT 


ACGTTTCGGN 


GGAACAACTT 


TAATGCTAGT 


TGTTTAATGT 


CATAGGTCGG 


2100 




ATCTTGGGGG 


AC T AAAGTTG 


GGGTCC CNTC 


TCTATTGGAA 


ATCAAA 




2146 



35 

(2) INFORMATION FOR SEQ ID NO: 98: 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1015 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
(D> TOPOLOGY: linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

50 



ATTGTC C C AC 


ACCATTTACT 


ACAAGANACG 


AGCGTCCTCC 


ACATTGTTAC 


NTTGTGCNAT 


60 


CAATNCTTCT 


NGTTTGTTTT 


GAANNNAAGA 


CCGAAGCATT 


TGTANTCTGC 


TGTGTTCGCA 


120 


GCCTTGTTTC 


GCACGTTCTA 


AGGTATTGAA 


TACCGTTTAA 


TGCAGTATTG 


GCTTGTGTAA 


180 


CAGCTTGCAT 


TGCTCTATCG 


ACATCTGCNT 


TAGG TGTATT 


TCCTCCAGCT 


GTTTTATNTA 


240 


AAATAGTTGC 


TGCTGCATTT 


AC TGC TTG AG 


AATAAGCCGT 


TCGTTTAGCA 


TCATCAGCAT 


300 
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CTTGATAATT 


TTGAPTTTVJT 
x vjrt \_ x x x \j x 


2x JxC^CZTCCZTCT 




AA i fGAl GTT 


TCTAATTGAC 


360 




CCATAGCACC 


ATCTAATTGT 


ILjL LjL, 111 Lx\j 


^1 1 IrvlL. I Vj> X 


Al 1 AALAL v_ 1 


L AAC ATTTGT 


420 


5 


TGCTNG TGT A 


ATTTCATTAT 


CTAACGCATT 


ACGTTGTGCA 


TTATTAATGT 


GTGTTAATGT 


480 




ACCTAACGTT 


TGTTTCGCAG 


CAGCTTTAGC 


TTCATTTAAT 


TTCGCATCAC 


CGCNCAACGC 


540 




CGTCTTCGTA 


CTGTNCACAT 


TGTGTAATGC 


TTGNNCAACT 


GCTGCTTTGT 


CTACATTGTG 


600 




AC C ACT AGC T 


TTTGT TAAAA 


TTGCTTTTGC 


TGCATTTACT 


GCTTGATCAT 


AAGCTGATTT 


660 


10 


CTT AC TTGGC 


TCAGCATCTA 


GGTATTTCTG 


AGTTTGTTTT 


GGTTGTGTCT 


CATCATTGAT 


720 




AC C ATTTTGT 


AAACTTGTGC 


ATTGCGTTAT 


TTAATTCTTG 


TTGCTTTNGC 


AGGTTC TTGG 


780 




ATTTACACCA 


GCTACTGTAG 


GTGGNACCAT 


CAATGTTAAG 


CGTTAACGCA 


TCTTTTTGTG 


840 




CATTGTTAAT 


TGATGTTAAG 


TTATTCAAGT 


TTGTTTTTCG 


CTGTTGTCTT 


AGCTTGAGCT 


900 


15 


AAGTTTTGGC 


GCACCATTTA 


ATGCATGTTC 


TTTAGTNGGT 


CACTTGTGAT 


GTTGCTTGGC 


960 




GTAATAGTAT 


TNGGGNTCCA 


TCCNNTGGNT 


TCCACGTTTG 


GATTAATGAT 


TGCTT 


1015 



(2) INFORMATION FOR SEQ ID NO: 99: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1550 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 





(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


99: 








GGCACCAGCT 


AGAGCGAGTT 


TTTATATATA 


CAACACGAAA 


GAAGACATTG 


ATC AG TTAAT 


60 


35 


AAATGCCTTG 


AAACAAACGA 


AGGAGTTTTT 


CTCTTATGAA TTTTAATAAT CTAGATCAAT 


120 




TATATAGATC 


TGTCATTATG 


GATCATTATA 


AAAATCCTAC 


AAATAAAGGT 


GTATTANATA 


180 




AC GGGTCTAT 


GACAGTAGAT 


ATGAATAACC 


CGACATGCGG 


TGACCGTATA 


CGACTAACAT 


240 




TTGATATAGA 


AGACGGCATT 


ATAAAAGATG 


CTAAGTTTGA 


AGGTGAAGGT 


TGTTCGATTT 


300 


40 


CAATGGCAAG 


TGCATCGATG 


ATGACACAAG 


CTGTTAAAGG 


GAATCCANTT 


GGAGAAGCAA 


360 




TCCAAATNGA 


CCCAAGGAAT 


TTACCGNAAA 


TGATGCTTGG 


TTGGAAGCCT 


ANTGTGATNC 


420 




AGGGAAGAAT 


GGGGAGATAT 


TGAAGCATTC 


CNGGGTTGAT 


CTCAATCCCA 


GCTCGTATTA 


480 




AATTTGTCCA 


C ATACC TTGG 


AAACATTGGA 


AAAAGGGTCC 


TTGTTCCTAA 


AGGAGGGTAA 


540 


45 


AACAGGAGGG 


TCCGGCTTGA 


AGAAGAAAAG 


ATCC TGTTAA 


TCATAAGATG 


ATTTTGATAT 


600 




TAAGACATAT 


NNAAGTATNN 


NAATTTTTAA 


TAAAGATGTC 


ATGTC ATTGT 


AATAAATATG 


660 




GTTTACATCA 


TTGAATTAAA 


AACTTACGCA 


CCGCCGTTGT 


AAATATATTT 


TTAAGGAGTG 


720 




ATTGAAATGG 


CTAAAAAAGC 


ACC TGATGTT 


GGGGATTATA AATATGGATT 


CCCCGCCGAT 


780 


SO 


GATGTATCCA 


TTTTCAGATC 


AGAACGTGGT 


TTAACTGAGA 


ATATCGTTAG 


AGAAATTTCT 


840 




AACATGAAAA 


ATGAGCCGGA 


ATGGATGTTA 


GATTTCCGTC 


TTAAATCATT 


AAAATTGTTT 


900 




TATAAAATGC 


CAATGCCTCA 


ATGGGGTGGC 


GACTTATCAG 


AATTGAATTT 


CGATGACATT 


960 




AC TT ACT ATG 


TAAAGCCTTC 


AGAACAAGCT 


GAACGTTCAT 


GGGATGAAGT 


GCCAGAAGAA 


1020 


55 


ATTAAAAGAA 


CTTTCGATAA 


ATTAGGAATT 


CCTGAAGCTG 


AACAAAAATA 


TTTAGCTGGT 


1080 




GTTTCTGCTC 


AATATGAATC 


TGAAGTTGTT 


TACCATAATA 


TGGAAAAAGA 


ACTTGAAGAA 


1140 
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AAAGGTATTA TCTTTAAAGA TACAGATAGT GCTTTACAAG AAAATGAAGA ATTATTCAAA 1200 

AAATACTTTG CTTCTGTAGT ACCTGC AGC A GATAACAAAT TTGCGGCGTT AAACTCAGCA 12 60 

GTATGGTCAG GTGGNTCGCT CATTTATGTA CCTAAAAATA TCAAACTAGA TACGCCACTA 1320 

CAAGCTTATT TCCGTATTAA CTCTGAGAAC ATGGGTCAAT TTGAACGTAC ATTAATCATT 13 80 

GCTGATGAAG GTGCTTCTGT ACATTACGTA GAAGG TTGGT ACTGCACCAG TTTATACAAC 1440 

TAGNTCTTTA CACTCTGCTG TTGG TGGNAA TCATTGGGCA TAAAGATGCG CACNGTCCGC 1500 

NTTATTC TT A CGAANCAAAA C TGCGGG AC A ATGTTTTCAA CTNNAGGTAC 15 50 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1449 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 



25 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO:! 


100: 








TGATAATACT 


GCTTAATACA 


ACATTTATTG 


CAATTAAAAG 


TGCAGTAATA 


GCCAGTTTTT 


60 




CNTTGANTTC 


AAAATGANTG 


TCCTCCNTTT 


TTGTTTCTAA 


NTAATCACTA 


TGCTTGGCTT 


120 


30 


TATTATGGTC 


ANNTAAACGT 


GTTTCCATTT 


GTTGATACTA 


ACATTTTCAA 


TAATTGATTC 


180 




GCTTCATATT 


GTGAAGTTTG 


AAAC TGTTC A 


NCTATGGGCA 


ATGTATTTAT 


TTCTGCTTCT 


240 




ATACTTTGAN 


TGGTATGTTC 


CGACTGTTCT 


AGCGCATTTT 


GTTTCCCGTA 


ATTTTGAAAG 


300 




TTTACTGCTT 


GTTTTTGATG 


CTTTTTTAAC 


ATATCCATTT 


TAGTCTTTAT 


CGTTTGGTTC 


360 


35 


TGATGAATCT 


GTGCTTCAAT 


TTGTTGATAT 


GTTTTGATAG 


AANCCCAAAT 


TTTAATTTTA 


420 




NTTGCAATAT 


AACCCGCTTG 


TTCCAAANCG 


TCATCTTTAT 


AAAACAATTT 


ATTTGAATCA 


480 




CCNATTTCGG 


GCTCCTACTT 


CCTTTACCAA 


ACTTGCCATT 


TAATGGAATA 


CTGTTTAGCT 


540 




TCATCTATTC 


GTACTTCCAC 


TAGTTTACCC 


AATCATTTCT 


TTAGGTGCTT 


TGAAATTAAC 


600 


40 


TAGCTTATTT 


TTATCAGTGT 


AGCCAGCAAG 


AACC TGATC A 


TCTTTTTTAC 


TACTACCTTC 


660 




ACAAAGTACT 


GTTACAGTTT 


GTCC TTCGTA 


CTTAC TC AT A 


GCTATTTGTG 


AATAATGACC 


720 




AACTTTTTTA 


TTCAAACGTT 


GCAATCGTTC 


C TTTTTGAC A 


TTTAAAGGTA 


CATTATCTTT 


780 




CATTTTAGCA 


GCAGGCGTAC 


CATC AC GTTG 


TGAATACAAG 


TACGTATATG 


CATGTTCAAA 


840 


45 


ACCAACTTCA 


TCATACAGAG 


TTAAAGTTTC 


TTCAAATTGT 


TCCTCTGATT 


CATTTGGATA 


900 




CCCTACAATA 


ATATCTGTAG 


TTAATGCTAC 


ATTAGGAAGT 


CTATCTTTGA 


TTCGTTTTAC 


960 




TAAATCCAAA 


TAACTTTCTC 


GTGTATATTT 


TCTACCCATT 


ATTTTTAATA 


CTGCATTATT 


1020 




TCCAGATTGA 


ACTGGCAAGT 


GGATATGAGG 


AACGATATTA 


CCACCCTCTG 


AAATAACATC 


1080 


SO 


AATCATGTGA 


TCTGTAAAGT 


CCCAAGGATG 


ACTTGTTGTG 


AAACGAACTC 


TTGGAATCGC 


1140 




TATTTTAGAA 


ATTGCTTGTA 


AAAGATCTCC 


TAAGTCATAT 


TCTATATCCT 


GTAAATCTTT 


1200 




ACCATAAGAA 


TTTACATTGT 


GACCTAAAAG 


CGTTATTTCT 


TTGTAACCTT 


CACGAGCAAG 


1260 




TTCACGTACT 


TCATCTATAA 


TGTCTTCAGG 


TCTACGGGTC 


CGGTCCTTAC 


TCCTTGTNAA 


1320 


55 


TGGAACAATA 


CAATATGTAC 


AAAACTTATC 


ACAACCATAC 


ATAATATTGA 


CCCATGCTTT 


1380 




ATGTTGCTTC 


ACGGACTTGT 


GGNAGATTAC 


AATAACGTCT 


NCTTCTTTTA 


GACATACTCA 


1440 
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10 



20 



25 



35 



CAACAATGT 1449 
(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 459 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

GAAAGTGTTT CAGAACGTGG AGCAAAAGGC TTCGGAAGTA GCGGAGTGTA AAGACATCTT 60 

AGATCGAGTT AAGGAGGTTT TGGGGAAGTG ACGCAATACT TAGTCACAAC ATTCAAAGAT 120 

TCAACAGGAC GACCACATGA ACATATTACT GTGGCTAGAG ATAATCAGAC GTTTACAGTT 180 

ATTGAGGCAG AGAGTAAAGA AGAAGCTGAG CGCAAATACG AGGCACAAGT TAAGATAAGG 2 40 

AGAGATGGAG ATGCCAAAGA AAACGGNAAC GATTGATGTA GATGAAAACT TATTAGTAGT 3 00 

AGCTAGTAAT GAAATATCAG AACTATTATA TGAATATGAC AGTGAGTTAA TCNCAGCTGG 3 60 

ATGAAGATGG CGATAATAGA GATATCGGAA GGAAAAAAGA GGACGCATTA AAACAAGCTA 420 

TACAAAATTT CTCGATAAAT TACATGGGGG TGTTTAGTG 459 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2005 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

GGCACGAGCT CATACATCGC TAGTAAGAAT AATAATCTTA GTATTTGTTT AGTCATTCCC 60 

CACACTCCCT TATATTTTCA AACAACTGAC TCACTTTAGC ATAATTCCAT CTCTCCTATC 120 

TTAACCTTGT GCCTCGTANT TGCGC TCAGC TTCTTCTTTA CTCTCTGCCT CAACAACTGT 180 

AAACGTCTGA TTATCTCTAG CAGTAGTAAA ATGTTCATGT GGTTGTCCTT GTTGAATCTT 240 

TGAATGTTGT GACTAAGTAT TGTGTCATTC CTCATAGCTC CCTTGAACTT GTTTGAGCTT 300 

ACTCATAAAA AACATTACTA AAAATGCTAT TAAGATATGC GTCTTTTGAT GTTTATAAGC 360 

AAATGTAGAT ATCATAAAGA TAGTAGCAAG CATTAACATT TCATATATGT TTGTGTGTAT 420 

AGTCTTTTTA CTCTTAAGAA AAATAATTGC TATGCGATAA AAGAGATAAA CGCCAAACCC 480 

TATTAAAAAT ATTTCTAACA TGTCGCTCAC TTCCCCAAAA CCTCCTTGAC TCGATCTAAG 540 
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ATGTCTTTAT 


ACTCCGCTAC 


TTCCGAAGCC 


TTTTGCTCCA 


CGTTCTGAAA 


CACACTCGAA 


600 




TTCCTCCACT 


TGCTTTAGTT 


CAGGTGTCCA 


TATAGGCACG 


ATAACCAATT 


GAGCTAGTTT 


660 


5 


GTCGCCTTCG 


TTGATTTGAT 


AAGTTCCATA 


TTGTCTTATG 


GCGTCACTCA 


AATCGATTTC 


720 




TCCTTTAATA 


TC AAAAAC AC 


CTGGTGTGAT 


ATAACCATTC 


GATGCAATAG 


CGTCATTCTT 


780 




GATATTAATC 


CCTAAATTGC 


CGTGATATCC 


CGCGTCTATC 


TTGCCTGTTT 


CAATCACTAA 


840 




ATGCGTTTTA 


CTACTTACAC 


CACTACGACT 


AGTTAATAGT 


CCGACATAGC 


CCTCTGGTAT 


900 


10 


ACTCACAGCT 


ACATCTGTTT 


TGATCACTGC 


TTTTTCTTGT 


GGTTCGAGTA 


CGACAGTTTC 


960 




AGCTGAGAAT 


ATGTCATAAC 


CTGCATCCGT 


CTTATGATTT 


CGTTCGGGCA 


TTC TAGCATT 


1020 




TTTTGATAAT 


AGTTTTACTT 


GTAATGTGTT 


AGTCATTTTC 


CTATTCCTCC 


TCATATTTAT 


1080 




AGACAACTTG 


ACCTGCCATA 


ATCCCTACTG 


CTTCATCAAG 


TTCAATACCT 


CNTTTAACTG 


1140 


15 


AATGTTGAAT 


AGCATTTGTC 


ATTCCCTCAA 


GTATTTCATC 


AAACGCTTGC 


GC TTTCTT AT 


1200 




ACACGTCCTC 


AATCTC TTTT 


AGCAACCCCT 


CTGTGTCATT 


ACCGTAT AC G 


CACTAGCACT 


1260 




AATAACGGAC 


TGTTCGATTT 


TTTCGCGATT 


ATTCATTGGT 


GTCATCCTCC 


ATAAAAATTT 


1320 




TATTGTTTAA 


TTCCATTCCG 


AATTTAACTC 


TTTCATCATC 


GTTACCGAAT 


TCGTTTATTA 


1380 


20 


AATCTTTTTC 


AACGCTCTTG 


CAATACCTAT 


CCCATGCGCT 


TGCTTTCTTC 


TCCAGTTCTT 


1440 




TGTTACAATC 


TCGTAACTTC 


GCTATATCCC 


CAATAAGCTC 


ATC TCGTTGC 


TTCTTGTACT 


1500 




C TTC ACGATC 


TTTTAATGCT 


TTGTGAAGTT 


TATCTAATAA 


CTTGTTAGAG 


TTAGTACAAA 


1560 




GATTTTTATA 


TTGTTCATCT 


GATAAGGTGA 


ACGTCATCTC 


ATAACCTCCA 


ATAGCATCTC • 


1620 


25 


ATT TTCAAAA 


ATATTTCCAA 


CAATTTCAAT 


AATATCGGCA 


TT TTC AC TT A 


GTAATTCAGT 


1680 




TACATTGCTA 


AAAGTTATAT 


AAAAGGCTCC 


TTCTTTAAAC 


TCGATAAAAC 


TTACTTCTCT 


1740 




CGAATTAACA 


ATCTTGGAAC 


AATATCCCCT 


TCATAAATCT 


CCACAACCGG 


GCACATCTTT 


1800 




TAAATCCTGT 


GTATTGGTAA 


TAGGTTTTAC 


TTCAATTGAA 


ACTTTTATTA 


ACCTGGTGGA 


1860 


30 


AATCAAAATG 


TACCCACTTA 


TTAAAAATCG 


GATTTCGGNC 


AATAATACTC 


AATAACTTNN 


1920 




NNTATC TTNA 


TCCCAAGCTT 


TTAATTTCAA 


CATCAATCTT 


ACCAACTCCC 


CATCTTTCCA 


1980 




AATCAATGTC 


AACCGGCAAN 


GTCAC 








2005 



35 (2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 796 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 



50 


GGCACGAGCT 


GTTTTTTATT 


TGCTTCGTTA 


CAAAGCATTA 


TTGAATTTAT 


TTTACGTGTT 


60 




CATATTTTGA 


AACATCAAAG 


CCGTCTTGCT 


TAGCTTTGTT 


GAT AATGTC T 


TTGATTGAAT 


120 




GTAGTCCTTT 


ATCGGCGAAG 


TATGATCTTA 


AGTTGTCTTT 


TGTAGCTTGG 


TCAGCATTCT 


180 




TATCTAATAA 


CACATCGATA 


TAGCTTAATT 


CATGTTCTAA 


GAAGTTTGCG 


TCATCATGTA 


240 


55 


GTACGAGTCC 


ATTTTGAGAA 


TAAACTTTCG 


CATCTGCTTG 


ATTACCATAT 


CCAACAACGC 


300 




CAGTTGCTAA 


TACACCTACC 


ATTGCCGTAG 


CTACTAAAAC 


CTTTTTAAAT 


TTCATATCTA 


360 
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TCACTCCTCT 


AAAAATTGTA 


CTCTATCATA 


ACACTTGAAT 


ATTAAGAAAA 


TTACGGNTTA 


420 




TTAAGTCGGA 


CTTNANTAAT 


TCTTAATAAA 


TAGNTAAACT 


GACAAATATT 


NGCTNAAATG 


480 


5 


CAATTANTCT 


TNAAAACCGG 


GGTTTATGGA 


TTTTTCCTAC 


TAAAACC TTG 


ATTTCAAAAA 


540 




GGGTTTANCT 


CAAATGAAAC 


AATAATAAAA 


AATAATGCAA 


CATAATAATA 


AG T AC AAATT 


600 




TANTTAAGAA 


ATTAAATTGA 


TTGTATATGT 


ATATTNTGGT 


AACGTNAAAG 


AGAAATATNC 


660 




AANATAATTA 


ATTATTTATT 


TGAAAAGAGA 


ATATTAATGA 


AGTATTAAAC 


AAAGAGACGT 


720 


10 


GAAACGATGC 


GATATTTAAA 


AAGAC TTTC A 


TGGNACATAA 


GCATCTTAAT 


TTTAATAGGT 


780 




GGTATTGCTG 


GGTGGG 










796 




(2) INFORMATION FOR SEQ 


ID NO:104: 








15 


(i) SEQUENCE CHARACTERISTICS: 










(A) 


LENGTH: 1122 base pairs 










(B) 


TYPE: nucleic acid 










20 


(C; 


STRANDEDNESS : single 












\u) 


TOPOLOGY: 


linear 










25 


(ii) 


MOLECULE TYPE: Genomic DNA 








(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 104: 








AACTTTG AG T 


NTTTTGTTCN 


AT ATGGAAC T 


TTAAAGATTT 


GGAAATTATN 


TAGATGGAGA 


60 


30 


TATATCATAT 


AATCCAGAGG 


TGCCNAGTTA 


TTCAGCTAAA 


TATCAATTAN 


/•link » rrv**» 7v fYV~* v< 

CTAATGATGA 


12 U 




TNACAATGTA 


AAGCAATTAC 


GTAAAAGATA 


TGACATNCCA 


GCAGAATAAA 


GCACCCAAAG 


ion 
1 o U 




TTATTTGTTG 


AAAGGGTACA 


GGGTAATTTN 


AAAGCCTCAT 


CAGTTGGATA 


TAAAGACAGT 


*5 A f\ 

z 4 U 




GAATTTACTT 


TCGTAGAGAN 


NAAAGGTGAA 


AATATATACT 


TTAGTGATAG 


TCTACATCTT 


i n n 
iUU 


35 


GANCCAAGTG 


AGGATAAATA 


AACGTGACCA 


ATAAAGAGTA 


TGAAATCGAA 


CCCGGAAAAA 


jbU 




GAGAGTGTGA 


AATGATGAAA 


CGATTAAATA 


AATTAGTGTT 


AGGCATTAGT 


TTTCTGTTTT 


/on 




TAGTCATTAG 


TATCACTGCT 


GGTTGTGGCA 


TAGGTAAAGA 


AGCGGAAGTT 


ik a a a m/""T' 
AAbAAAAbL i 






TTGAAAAAAC 


ATTGAGTATG 


TACCCTATTA 


AAAATCTAGA 


GGATTTATAC 


GATAAGGAAG 


540 


40 


GCTATCGTGA 


TGATCAGTTT 


GATAAAAATG 


ATAAAGGTAC 


ATGGATTATA 


AATTCTGAAA 


600 




TGGTTATTCA 


ACCTAATAAT 


GAAGATATGG 


TAGCTAAAGG 


CATGGTTC T A 


TATATCGAAT 


660 




AGAAATACCA 


AAACAACAAA 


TGGTTACTAC 


TATGTCGATG 


TGACTAAGGA 


CGAGGATGAA 


720 




GGAAAACCGC 


ACGACAATGA 


AAAAAGATAT 


CCGGTTAAAA 


TGGTCCGATA 


ATAAAATCAT 


780 


45 


TCCAACAAAA 


GAAATTANAG 


ATAANAACAT 


AAAAAAAGAA 


NTCGAAAACT 


TTAAGTTCTT 


840 




TGTTCAATAT 


GGAAACTTTA 


AAGATTTGTC 


GAAGTACAAA 


GATGGAGATA 


TTTCATACAA 


900 




TCCAGAGGTG 


CCAAGTTATT 


CACCAAAATA 


TCAAGTAACT 


AATGATGACT 


ATAATGTAAN 


960 




ACAATTNCGN 


AAAAGATATA 


GATATACCGA 


CCGAATAAAG 


CACCANAGCT 


ATTGTTGANA 


1020 


SO 


GGTACAGGGA 


ATTTAAAAGG 


TTCATCAATT 


GGCTACACAA 


AAATTGAATT 


TACTTTCGTA 


1080 




GAGAAAAAGG 


GAGAAAATAT 


ATACTTTAGT 


G ATGGGC T AC 


AG 




1122 



(2) INFORMATION FOR SEQ ID NO: 105: 

55 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 652 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 







CTATTCCCGT 


TTTGCCATCT 


CNAATTTCTT 


ACCATGGCCC 


ACAAAGTATT 


60 


15 


GTNACGTTTG 


AGCTGATCCT 


TATTTAATTT 


TNAAATTGAG 


AAAAATGTCT 


CCAAATCCTG 


120 




TACCTNCCCC 


ATATTCCCGC 


AATACAAATA 


AAATTTTGTC 


GTATTTGCTC 


NCCGGTAAGC 


180 




TTTAAATGTG 


GTCATTATAA 


GANCGGATTG 


TCTTGTAATT 


GACCGCATGT 


CATACCAGTT 


240 




AGGAATCACA 


TGGATATTGT 


CAGCATTTTT 


AAGAAATTTG 


ATGATTTAGT 


AAGTAGTTTT 


300 


20 


TCATTTCCGT 


ACCCAAGGAC 


AATGACATTT 


TCAGCATTCT 


TGTAGACATG 


TCTATTAATG 


360 




TAACGCATCA 


GCTTATCAAT 


CATGCTACCT 


GGACGAGTTG 


CACCTGTCTT 


AATCGCATTA 


420 




TCAGGTGCTA 


TATCATACAC 


CACAAAAGAA 


TATTTTTTCT 


TAAGCAGTCT 


GTGTAAAACG 


480 




TCTGGTATTA 


ATGGCAAGAT 


TGGTGGATTA 


GAGTAAACAA 


GAATCTGATC 


ATATTTCAAC 


540 


25 


ATTTTAGGTA 


TATTAATCAC 


GAATTTTGAA 


AATAAACTAA 


AGAAATTGAT 


GATCCTTCCA 


600 




ACCTTACTTT 


TGTTATTAAA 


CCTCGAATAC 


TTGAGACGTC 


GAATGCGAAN 


TC 


652 



(2) INFORMATION FOR SEQ ID NO: 106: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 392 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 





GAATTC CCTT 


CAAAAGCACC 


TTTGATACCG 


AATAAATTAT 


GGTTTGGTGA 


CTTAGATAAA 


60 


45 


GAACTAGCGG 


GAAGATTTTA 


ATCGCAAGAC 


TGGTGCGAAT 


GTGAAGTATA 


TTGAAGCACC 


120 




TTATGAACCG 


CATAAGTTTG 


TGAAAATGGT 


GAAGGATAAA 


GAATTAGCTG 


ATGAAAAAGA 


180 




AGGCGGCTTA 


CGTNGTACCG 


CTTGTTTTGA 


AATGCGTTTG 


GATATTGTAG 


CGAAAGCAGC 


240 




TGTAGAACAT 


GGCTATGATT 


ATTTTGGCAG 


TGCAATCACG 


TTATCACCTA 


AAAAGAACGC 


300 


50 


ACAATTAATC 


AATGAACTTG 


GTATGGATTG 


TCCAAAAAAT 


ATACGATGTG 


AACTTATTTG 


360 




CAAGTGATTT 


TAAGAAAACT 


AAGGTATTGA 


GC 






392 



(2) INFORMATION FOR SEQ ID NO: 107: 

55 

(i) SEQUENCE CHARACTERISTICS: 



96 



EP 0 841 394 A2 



(A) LENGTH: 3797 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 





(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


107: 








TCAAATGCAG 


TCAGGGAAGC 


AATAGGACGA 


TATGCATAAA 


GGAGATGGTA 


AAGTGGAACA 


60 


15 


GTGACAGAAG 


GTAAAGACAC 


GCTTCAATCA 


TCGGAGNCAT 


CAATCAANCA 


CAAAATAGTA 


120 




AAACAATCAG 


GAACGCAAAA 


TGATAATCAA 


GTAAAGCAAG 


ATTCTGGAAC 


GACAAGGTTC 


180 




TAAACAGTCA 


CACCAAAATA 


ATGCGACTAA 


TAATACTGAA 


CGTCAAAATG 


ATCAGGTTCA 


240 




AAATACCCAT 


CATGCTGAAC 


GTAATGGATC 


ACAATCGACA 


ACGTCACAAT 


CGAATGATGT 


300 


20 


TGATAAATCA 


CAACCATCCA 


TTCCGGCACA 


AAAGGTATTA 


CCCAATCATG 


ATAAAGCAGC 


360 




ACCAACTTCA 


ACTACACCCC 


CGTCTAATGA 


TAAAACTGCA 


CCTAAATCAA 


CAAAAGCACA 


420 




AGATGCAACC 


A A /~> A A Ik /■-» 

ACGGACAAAC 


ATCCAAATCA 


ACAAGATACA 


CATCAACCCG 


CGTGCCTCAA 


480 




ATCATAGATG 


CAAAGCAAGA 


TG AT AC TGTT 


CGCCAAAGTG 


AACAGAAACC 


ACAAGTTGGC 


540 


25 


GATTTAAGTA 


AACATATCGA 


TGGTCAAAAT 


TCCCCAGAGA 


AACCGACAGA 


T AAAAAT AC T 


600 




GATAATAAAC 


7A A /Tn A A TV' A X 

AAL i AATCAA 


AGATGCGCTT 


CAAGCGCCTA 


AAACACGTTC 


GACTACAAAT 


660 




GCAGCAGCAG 


A 1 vjC J. AAAAA 


GGTTCGACCA 


CTTAAAGCGA 


ATCAAGTACA 


ACCACTTAAC 


720 




AAATATCCAG 


TTGTTTTTGT 


ACATGGATTT 


TTAGGATTAG 


TAGGCGATAA 


TGCACCTGCT 


780 


30 


TTATATCCAA 


Al I A J. IvAtvtVj 


TV^/"* AAA TI"A A A 

IWaAAAIAAA 


TTTAAAGTTA 


TCGAGGGAAT 


TGAGAAAGCA 


840 




AGGCTATAAT 


*j 1 ALA 1 LAAb 


LAAb 1 Vj 1 AACj 


TGCATTTGGT 


AGTAACTATG 


ATCGCGCTGT 


900 




AGAACTTTAT 


1A1 1ALA1 1A 


a A/^/^ip/^OrTV* A 

AAviCj 1 Ia_A 


CGAGCGTAGA 


TTATGGCGCA 


GCACATGCAG 


960 




CTAAATACGG 


ACATGAGCGC 


TATGGTAAGA 


CTTATAAAGG 


AATCATGCCT 


AATTGGGAAC 


1020 


35 


CTGGTAAAAA 


GGTACATCTT 


GTAGGGCATA 


GTATGGGTGG 


TCAAACAATT 


CGTTTAATGG 


1080 




AAGAGTTTTT 


AAGAAATGGT 


AACAAAGAAG 


AAATTGCCTA 


TCATAAAGCG 


CATGGTGGAG 


1140 




AAATATCACC 


ATTATTCACT 


GGTGGTCATA 


ACAATATGGT 


TGCATCAATC 


ACAACATTAG 


1200 




CAACACCACA 


TAATGGTTCA 


CAAGCAGCTG 


ATAAGTTTGG 


AAATACAGAA 


GCTGTTAGAA 


1260 


40 


AAATCATGTT 


CGCTTTAAAT 


CGATTTATGG 


GTAACAAGTA 


TTCCGAATAT 


CGATTTAGGA 


1320 




TTAACGCAAT 


GGGGCTTTAA 


ACAATTACCA 


AATGAGAGTT 


ACATTGACTA 


TATTAAAACG 


1380 




CGTTAGTAAA 


AGCAAAATTT 


GGACATCAGA 


CGATAATGCT 


GCCTATGATT 


TAACGTTAGA 


1440 




TGGCTCTGCA 


AAATTGAACA 


ACATGACAAG 


TATGAATCCT 


AATATTACGT 


ATACGACTTA 


1500 


45 


TACAGGTGTG 


TCTTCACATA 


CTGGTCCATT 


AGGGCACGAA 


AATCCTGCCG 


AATTAGGCAC 


1560 




GAGACATTTT 


TC TTAATGG A 


TACAACGAGT 


AGAATTATTG 


GTCATGATGC 


AAGAGAAGAA 


1620 




TGGCGTAAAA 


ATGATGGTGT 


CGTACCAGTG 


ATTTCGTCGT 


TACATCCATC 


CAATCAACCA 


1680 




TTTATTAATG 


TTACGAATGA 


TGAACCTGCC 


ACACGCAGAG 


GTATCTGGCA 


AGTTAAACCA 


1740 


SO 


ATCATACAAG 


GATGGGATCA 


TGTCGATTTT 


ATCGGTGTGG 


ACTTCCTGGA 


TTTCAACACC 


1800 




GTAAGGTGCA 


GAACTTGCCA 


ACTTC TATAC 


AGGTATAATA 


AATGACTTGT 


TGCGTGTGGA 


1860 




AGCGNCTGAA 


AGTAAAGGAA 


CACAATTGAA 


AGCAAGTTAA 


ATTCATCTTC 


TGAATTTAAT 


1920 




AGGCTATGTA 


AATCGTGCTG 


TTATCATGGC 


ACATCAGATA 


TAAGTAGCAT 


CACAGTGTTG 


1980 


55 


AATCTCAAAA 


TAGTAAAGTG 


AAATAAAGCG 


CCTGTCTCAT 


TAGCGAAAAC 


TAAAGGGACA 


2040 




GGCGTATCTG 


TTTATGAGCT 


TAATAAATTG 


TATGAATAAT 


ATGGTTGATC 


GAATAACTGT 


2100 
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TTATCATTGA 


TGATAAATTT 


GAGTTTTTTA 


AAAATAATTG 


ATATATTACA 


CCATTGTTAT 


2160 




AGC GTTT AAA 


GAAATCAACC 


CAACTTTACG 


ATAAATAGTG 


ATTGCTTCGT 


CATTAGGTCT 


2220 


5 


ACGATCAAAA 


TCATGCTCGT 


TTTTATTCAC 


GCGTTCAAAT 


GTTGAATGTG 


GAACATGATT 


2280 




CATGATATGT 


TCGCTTTCCT 


CAACGGGAAC 


ATCATAATCG 


CCATTACAAT 


GCGCAATGAA 


2340 




AACAGGTGGA 


AGTGTTTTAA 


GMTCATCTGG 


TGCAATATTA 


TATTTTGAAT 


CAGTATAATC 


2400 




ANCAATGTTA 


ATCATATTTA 


TCCATTTACC 


TGTGCCACGT 


GCATAAACGT 


AGAGTAAAAA 


2460 


10 


ACGTGTGCGA 


TTTGATCTTG 


ANCAACCGGT 


GTTGGTGAAG 


TGAGTTGTCC 


AATCATTGTT 


2520 




TCGTTTATGC 


TTTGAGCTAT 


TTTTGCGTAA 


TACCTATTAG 


TTGTTTTAAA 


AGGGTTCAGT 


2580 




GTTGATGCGA 


CTATAACCAT 


AAAAATCAAT 


AACACCATCA 


ATATCTCTGT 


CTCGTGCAAT 


2640 




TAATAAGACT 


TAAAT ATGC A 


CCTGATGATC 


TGCCAAAGGT 


AAAAATAGGG 


CAATTAGAAT 


2700 


15 


ATTGTGATTG 


AATCGCATCG 


AATGATGCGT 


AGACATCCTC 


AATAATGCAA 


TCGAGACTTA 


2760 




CTTCTGGTAA 


TAAACGATAA 


CTTAGTTGAA 


TTAAATCGTA 


ATGTTCCGTA 


AGGATATCGA 


2820 




TATACTGTGG 


GGATAAATCG 


TTAGCTTTAC 


CGAACATTAA 


TCCACCACCG 


TGGATGTAGA 


2880 




CAATAACGCC 


TTTTGTTGGT 


TGATTTTTTG 


CTTTAATAAT 


TGTGTAAGGT 


AATGCAAATG 


2940 


20 


CATCTTTAGT 


AATTACTTTA 


TATTTAATTT 


CAGTCACGAT 


TTAATAGGCT 


CCTTAGGAAT 


3000 




CCGATATTGA 


TGTCATTATA 


ACACTGTCNT 


NAATTTCCAT 


GN AAAAT AG T 


CTTAAGACGA 


3060 




TGAGTCATGA 


TAATTCTGTT 


CCAATTGACG 


TAAAGCGTCN 


CGGGTATGCT 


TCTTTAGACC 


3120 




TTCCCCATAA 


TCCATCATTT 


TAACAATATC 


TTTAAAAGCA 


GCATGTGGNA 


TGGCTAAATC 


3180 


25 


TTCTAAATCT 


GCCATAGAAA 


ATTCAAGATT 


GATATCATGT 


GGTCGCTGTT 


CAGCAAGTTT 


3240 




ATGCACAAAG 


TCAGGTTCTG 


TGACCAAAGG 


CGAAGACATG 


CCGACCATAT 


CTGCATGTTG 


3300 




TAAAGCATCT 


AAAGCAGACT 


CTGGAGAATT 


AATCCCGCCA 


CTTGCAATTA 


AAGGGATACG 


3360 




AC C TGC T AAA 


TGTTCATAGA 


CAATTTGGTT 


AACTGGTCGA 


CCGAAATGAT 


CACCTGGTGT 


3420 


30 


ACGAGACGTA 


TTTTGATAAA 


TATGTCGACC 


CCAGCTAGCG 


ATTGCTAAGT 


ATTGGATGTT 


3480 




TGAAACGTCC 


ATGACCCAAT 


CGATTAATTG 


GTTGAACTCG 


TCAATGGTAT 


ATCCTAAATC 


3540 




ACTGCCTCTG 


GTTTCTTCTG 


GCGTTGCTCG 


AAATCCTAAA 


ATAAAATTGT 


CAGGTGCTTC 


3600 




TTTATCAATC 


ACTTCTTGTA 


CCGCACGCAT 


AACTTCTAAA 


CATAATCTTG 


CACGATTTTT 


3660 


35 


TAATGAGTCG 


GCACCGTAAT 


GGTCTGTACG 


TCTATTTGAA 


AAAGTTGAGA 


AAAATGTTTG 


3720 




AATCAGCAAA 


CGTTGTGCAA 


TCGAAATTTC 


CACACCATCA 


AAACCTGCTT 


TAATCGCGCG 


3780 




TGCATCGAGC 


TCGTGCC 










3797 



40 (2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3165 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

55 CAAGCCAGTT CAAC TNACCG AC CCGGT ATA TTTGGGGAAT ATACGAANCN CNNTCAGATG 60 

ATCNAAATAT GANTGANACN CGGTNGCGTT C TCNAGNC AN AACAGNGTTA AATAANCGTA 120 



98 



EP 0 841 394 A2 





AGTCGGCAGT 


TATGTCANAT 


GAGATTACTT 


CTNCTGATTG 


GGNAGTTACG 


NNTCCACACG 


180 




AGANNATATC 


ANNTGGTGAN 


NCAGNCAGAG 


TAAAACATAG 


AGATTTTGAC 


CCGCCATTGT 


240 


5 


ATGTAGAGGC 


AGAAGTTATT 


GCCGAAGAAT 


ATAACATAAT 


TTCAGAAAAT 


AGCACATATA 


300 




CATTCGGTCA 


ACCTAAAGAG 


TTCAAAGAAT 


CAGAATTACG 


AGAAGAGTTT 


AACAAGCGAT 


360 




TGAACATAAT 


ACATCAAAAG 


TTAAACGATA 


ATATTAGCAA 


TATCAACACT 


ATAGTAAAAG 


420 




ATGTTGTAGA 


TAGTGAATTA 


GAATACTTTG 


AACGCAAAAT 


ACACAAAAGT 


GATACACCGC 


480 


10 


CAGAAAATCC 


AGTCAATGAT 


ATGCTTTGGT 


ATGATACAAG 


TAACCCTGAT 


GTTGCTGTCT 


540 




TGCGTAGATA 


TTGGAATGGT 


CGATGGATTG 


AAGAAAC AC C 


AAATGATGTT 


GNAAAATTAG 


600 




GTGGTATAAC 


AAGAGAGGAA 


AGCGCTATTC 


AGTGAATTAA 


ACAATATATT 


TATTAATTTA 


660 




TCTATACAAC 


ACGCTAGTCT 


TTTGTCAAGA 


AGCTACAGAA 


TTACTGGAAT 


AGC G AGT ACT 


720 


15 


TAG TAG AT AA 


TGATTTGAAA 


GCGGACTTAC 


AAGCAAGTTT 


AGACGCTGTG 


ATTGATGTTT 


780 




ATAATCAAAT 


TAAAAATAAT 


TTAGAATCTA 


TG AC AC CCG A 


AACTGCAACG 


ATTGGTCGGT 


840 




TGGTAGATAC 


AAAAACTTTA 


TTTCTTGAGT 


ATAGAAAGAA 


ATTACAAGAT 


GTTTATACAG 


900 




ATGTAGAAGA 


TGTCAAAATC 


GCCATTTCAG 


ATAGATTTAA 


ATTATTACAG 


TCACAATACA 


960 


20 


CTGATGAAAA 


ATATAAAGAA 


GCGTTGGAAA 


TAATAGCAAC 


AAAATTTGGT 


TTAACGGTGA 


1020 




ATGAAGATTT 


GCAGTTAGTC 


GGAGAACCTA 


ATGTTGTTAA 


ATCAGCTATT 


GAAGCAGCTA 


1080 




GAGAATCCAC 


AAAAGAACAA 


TTACGTGACT 


ATGTNAAAAC 


ATCGGACTAT 


NAAACAGACA 


1140 




ANGACGGGAT 


TGTTGAACGT 


NTAGATACTG 


CTGAAGCTGA 


GAGAACGACT 


TTNNAAGGGG 


1200 


25 


AAATCAAAGA 


TAAAGNTACG 


GTTANACGAA 


TATCGAAACG 


GATTGGAAGA 


ACAAAAACAA 


1260 




T ATAC TGATG 


ACCAGTTAAG 


TGATTTGTCC 


AATAATCCTG 


AGATTAAAGC 


AAGTATTGAA 


1320 




CAAGCAAATC 


AAGAAGCGCA 


AGAAGCTTTA 


AAATCATACA 


TTGATGCTCA 


AGATGATCTT 


1380 




AAAGAGAAGG 


AATCCCAAGC 


GTATGC TGAT 


GGTAAAATTT 


CGGAGAGAAG 


AGCAACGCGC 


1440 


30 


TATACAAGAT 


GCTCAAGCTA 


AACTTGNAGA 


GGCAAAACAA 


AACGCAGGAN 


CTTAAAGGCT 


1500 




TAGANACGCT 


GAAAAGANAG 


CTAATGTTTA 


TACAGACAAC 


AAGG TCAAAG 


AAAGCACAGA 


1560 




TGCACAGAGG 


AAAACATTGA 


CTCGCTATGG 


TTCTCAAATT 


ATACACAATG 


GTAAGGAAAT 


1620 




CANATTAAGA 


ACTACTAAAG 


AAGAGTTTAA 


TGCTTCTAAA 


AGAACACTAT 


CAAGAGTGTT 


1680 


35 


AGCAGACATC 


ACTGTAAATG 


CTATGAAAGG 


CATCTATTTA 


AGGTATGACG 


AAAATGGGGC 


1740 




GATTACTTCA 


CATACTATTG 


ATAAAGATGG 


CGTGAAAATT 


AGTGGCGATA 


AAGTTGATAT 


1800 




AACAGCGAAT 


AGAGAATTTA 


ATGTATTCGC 


AAATAATATT 


AATAACAAAG 


TTGGTAAAAA 


1860 




TGACATTGTT 


AATAGCCTAA 


ACTTATCAAA 


TGAAGGTCTT 


GACATCAATG 


TGAATAGAAT 


1920 


40 


TGGTATTAAA 


GGCGGAAATG 


CTAACCGTTA 


TGTACAAGTT 


CAAAATGATT 


TTATTGAACT 


1980 




TGGCGGAATC 


GTACAACGAA 


CTTGGAAAGG 


CAAACGATCA 


ACCGATGATA 


TATTCACACG 


2040 




TCTTAAAGAT 


GGACATCTAA 


GGTTTAGAAA 


TAATACCGCA 


GGCGGGTCAC 


TTTATATGTC 


2100 




ACATTTTGGT 


ATTTCAACAT 


ATATTGATGG 


AGAAGGCGAA 


GACGGAGGTT 


CATCCGGTAC 


2160 


45 


TATTCAATGG 


TGGGATAAAA 


C TT ACAGTG A 


TAGCGGTATG 


AATGGCATAA 


CAATCAATTC 


2220 




TTATGGCGGT 


GTAGTCGCTT 


TAACATCTGA 


CTACAATCGA 


ATTATTATCG 


ATTCATATGC 


2280 




TTCAGCTAAT 


ATTGAAAGTA 


GAGAAGCGCC 


GATATATTTA 


TCTCCGAACA 


CCC AAAAATT 


2340 




AAACCTGGNT 


TTAANCCGAT 


TCGCATTCAC 


ATTATCAAAC 


GCTGATAGGT 


NCATTACGAA 


2400 


SO 


ACTGGCGGTT 


ATATCATGTT 


GGGTTCAAGA 


TGNAANCTAT 


AAGTNCGGTG 


CTGGATTAAG 


2460 




ATTTTCTAAA 


CGTACCAATA 


AAGGATTGGT 


TCAAGTCGTT 


AATGGTGACT 


ATGCTACAGG 


2520 




CGGAGACACT 


ACAATTGAAT 


CAGGTATGGC 


CAAATTCAAC 


TTAGTTAANC 


GGAAGAGATG 


2580 




GAAATAGTTA 


CGTTAGCATT 


CAAAGTTATG 


ATTTATTGGC 


GGTAGGTTCT 


GATAATGCTG 


2640 


55 


GCGATAGAGT 


CGCTTCTAAT 


TCTATTTATA 


AGC GTACTTA 


TTCAGCACCT 


GCTAACTTAC 


2700 




ACATTACTTC 


TGCTGGAACA 


ATTGGGCGTG 


CT AC TTCTGC 


CAAAAAGTAT 


AAAATTTCAA 


2760 
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TCGAAAACCA 


ATACATCAAT 


GAAGACGATC 


AGTTCAGTCA 


TTCAAAAGAG 


ATTTTAAAGC 


2820 




TTCCAATTCG 


TACATGGTTT 


GACAAATATG 


AATCGGAAAT 


AATGGCTAAA 


GAATTGGAAA 


2880 


5 


GTGGTAAAAA 


GTTATCTGAT 


GAT AC TTTTA 


AACTTAGTCG 


ACATACTGGC 


TTAATAGCGG 


2940 




AAGAGGTTGA 


AGAATTAGGA 


TTTAATGAAT 


TTGTTATTTA 


TGATGACAAC 


GGAGAAATCG 


3000 




AAGG TATCGC 


ATACGATAGA 


CTTTGGGTTC 


ATCTANNACC 


TATTATTACC 


CNAAAACCAN 


3060 




CANTCAAANA 


NCGNAAAACN 


TANTGGNGGN 


NTTAACNTCT 


NATNGACNGC 


AACCAGGGTC 


3120 


10 


CNCATNCCAC 


CTCCACATTA 


TACAANTCAC 


TTTCTCNCGT 


CACTA 




3165 



(2) INFORMATION FOR SEQ ID NO: 109: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1229 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:109: 





AAAACAAGCG 


ATTATCACAG 


CAATTATTTT 


AATAGCACTG 


AGCTTGATTT 


TACCTGCATT 


60 




TGCAGTAGGT 


AATCACATTT 


CAATCTTCGT 


CATTTCTATG 


ATATTCTTTG 


CAGGTAGCTT 


120 


30 


TGCTTTAATG 


TATGCACCTT 


TACTTAACGA 


AGCC ATT AAA 


ACAATAGATC 


TTAATATGAC 


180 




AGGTGTGGCT 


ATTGGTTTTT 


ATAATTTAAT 


TATTAATGTG 


GCGGTATCTG 


TAGGTATTGC 


240 




GATTGCTGCG 


GCTCTAATCG 


ATTTTAAAGC 


ATTAAATTTC 


CCAGGCAATG 


ATGCATTAAG 


300 




TTCACATTTC 


GGTATTATTT 


TAATTATTTT 


AGGTTTAATG 


AGTATTGGTC 


CGGATTAGTT 


360 


35 


TTATTCCGNC 


ATC TTTAAAT 


CCGGTGGGAC 


AACAATCTTG 


AAAAATAAAT 


AGATATTAAA 


420 




TCGCGAGATA 


TATCCGTATT 


TTATTGGTAA 


AATTAAATTA 


AAGAGATTAT 


ATTACACGAG 


480 




GAGTAGTAAG 


TATTGAAATT 


GGAGAAATAT 


ATAGATCACA 


CTTTATTGAA 


GCCTGAGTCA 


540 




ACACGTACGC 


AAATCGATCA 


AATCATCGAT 


GAAGCGAAAG 


CATACCATTT 


TAAATCTGTA 


600 


40 


TGTGTGAATC 


CAACGCATGT 


TAAATATGCA 


GCAGAGCGAC 


TAGCTGATTC 


AGAGGTGCTC 


660 




GTTTGTACGG 


TAATAGGATT 


CCCATTAGGT 


GCGTCGACAA 


CTGCAACGAA 


AGCATTTGAA 


720 




ACAGAAGATG 


CAATTCAAAA 


TGGTGCAGAT 


GAAATTGACA 


TGGTCATCAA 


CATCGGCGCA 


780 




TTAAAAGATG 


GACGTTTTGA 


TGATGTACAA 


CAAGACATTG 


AAGC AGTGG T 


TAAAGCTGCG 


840 


45 


AAAGGTCACA 


CAGTAAAAGT 


GATTATTGAG 


ACGGTATTGT 


TGGAACCATG 


ACGAAATTGT 


900 




AAAAGCGAGT 


GAATTAACAA 


AAGCGGCTGG 


TGCGGACTTC 


GTTAAAACTT 


CAACAGGTTT 


960 




TGCAGGTGGC 


GGTGCGACTG 


CAGAAGACGT 


TAAATTAATG 


AAAGATACAA 


GTAGGTGCTG 


1020 




ATGTAGAAGT 


AAAAGCATCA 


GGTGGCGTAC 


G TAATTT AG A 


AGGATTTCAA 


TAAAATGGTT 


1080 


50 


GAAGCAGGTG 


CGACACGTAT 


TGGGCGCGAG 


CGCAGGCGTT 


CCAAATTATG 


CAAGGTTTAG 


1140 




AAGCAGATTC 


AAGATTACTA 


ATATATATAA 


AATTTGGGAG 


TGATAGCTAT 


GACAAAGACC 


1200 




ATTTAATCGN 


GTTCATTTTA 


TCCGTATGG 








1229 



55 (2) INFORMATION FOR SEQ ID NO: 110: 



100 



EP 0 841 394 A2 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1066 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 



15 


GGCACGAGAC 


GTAAATGNAA 


AGAAAACCGG 


CCTNCNTGAT 


TAAGANATTT 


AGAAAAGTGT 


60 




TTGTCATNAC 


GGTTGTCCCA 


GTGNTNAGTA 


TGTCCAAATA 


NAGGAAGTGA 


NAACGATGGN 


120 




ACCGTATTTT 


TAAAANAGGC 


AGNACATANC 


ATTTTAGATA 


GG AAGTGATA 


ATATGAATTT 


180 




GGCTAAACGC 


ANATTACAAG 


GGGAACNATT 


AACAANAGAG 


ACTGTAGTGA 


AAATTTATGA 


240 


20 


GGATACTAAT 


ATTGATACCT 


TAGNTTTATT 


AAATGAGGCG 


NACATCTTTA 


AGNAN AC ATT 


300 




ATTTTGGTGC 


NATNAGTCAA 


ATTAAACATG 


ATTTTAAATG 


CTAAAAGTGG 


NATATGTCCT 


360 




GAGAATTGTG 


GGCNCGAGAC 


TGTGGACAAT 


CACGAGATAT 


TAAACAANAA 


CAGCGATATG 


420 




CTTTAATTCC 


AGAGGAACAA 


ATTATCGATN 


GANCANAGGT 


GGCACATGAT 


AATCATATTG 


480 


25 


GAACATATTG 


TATTGTTATG 


AGTGGTAGAG 


GACCGAGCGA 


TAAAGAAGTT 


GATCATATTA 


540 




GTAATCCTGT 


AAGAACGATT 


AAATCTCAAC 


ACCCGCAACT 


AAAAATCTGT 


GCATGTTTAG 


600 




GATTAACGGA 


TTGCCGACCA 


AGCTAAGAAA 


CTTAAGTCAG 


CTGTTGTAGA 


CAGATATANC 


660 




CACAATATTA 


ATACCAAGTG 


AAAATTACCC 


ATTG AT AAC C 


GTCGTGCCAA 


CGCATAGTTA 


720 


30 


TNAAGATAGA 


ACAGATACGA 


TAGAACTAAT 


GAAAGCGAAT 


AATATATCAC 


CATGTTCTGG 


780 




CGTGATTTGT 


GGTATGGGAG 


AATCTAATCA 


AGATATTGTT 


GATATGGCAT 


TTGCTTTAAA 


840 




AGAAATGGAT 


GCCGACAGTA 


TTCCGATTAA 


TTTTTTGCAT 


CCAATCAAAG 


GCACAAAGTT 


900 




TGGAAGCATG 


GATGATTTAA 


CACCAATGAA 


ATGTTTAAGA 


ATCGTAGCAT 


TATTCCGATT 


960 


35 


AATCAATCCT 


ACGAAAGAAA 


TTCGTATTGC 


TGGAGGAAGA 


GAGGTCAACT 


TACGTTCGTT 


1020 




ACAGCCATTA 


GCATTAAAAG 


CGGCGAANTN 


CAATATTTGG 


GCGGGG 




1066 



(2) INFORMATION FOR SEQ ID NO: 111: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3222 base pairs 

(B) TYPE: nucleic acid 
45 (c) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

CCAAGTAAGT ATAGAAGCAG CATCCTTAGA AAAGAAAAAT GTAGATTTGA ACGAATTATT 60 
55 ACAGCGTCTC AATGACGTTG AACAAACGGA ATACACCAGG TTCGCTTNAC CCTAAATTTT 120 

TAAATGTTTC GTCACTTATT CAATATATTC AAGCAGCATA TCATGAACCT CTTAGAGAAG 180 
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AATTTAAAAA 


TTTAAC AC CT 


TATGTGACGA 


AATTATCGAA 


AGTACATGGA 


CCTAACCATC 


240 




CATATTTAGT 


CGAGTTAAAA 


GAAACATATG 


ATACATTTAA 


AAGTGGCATG 


TTAGAGCATA 


300 


5 


TCCAAAAAGA 


AGATGATGTT 


GATTTTCAAA 


ACTAATTAAA 


TATGAACAAG 


GTGAAGTAGT 


360 




AAACGATATT 


AATACAGTGA 


TTGATGATTT 


AGTATCTGAT 


CACATTGCAA 


CGGGACAATT 


420 




GTTAGTGAAA 


ATGAGCGATT 


TAACATCTAG 


CTATGAACCA 


CC GAT AG AGG 


CATGTGGTAC 


480 




GTGGCGACTC 


GTTTATCAGA 


GATTAAAAGC 


AC TTGAAGTG 


TTAACACATG 


AGCATGTTCA 


540 


10 


TTTAGAGAAT 


CATGTTTTAT 


TTAAAAAAGT 


ATCATAAATA 


ACGCGATTAG 


AAACTGTTGG 


600 




CAAAGTAATG 


TCAGCAGTTT 


TTC GCT AT AC 


TTAACAGAAA 


TTTAGTGATA 


TGAACAGCAT 


660 




TATTTGAAAA 


GAAAAATGGT 


CAACTTAGCA 


TAAAAATTGA 


TATGAAAATT 


TAATGGTATA 


720 




GATAATTAAA 


TAG T AGCGTG 


TTTTTTTAAT 


AATTTATTCA 


TGAATTTTAC 


ATGCACTATT 


780 


15 


ATGATAAAAT 


AAACATAATT 


ATAATTCACT 


GAGGTGCTAT 


CGTGCTATCG 


CT AAC AATGT 


840 




TATTACTTGA 


GCGTGTAGGT 


TTAATTATTA 


TTNTGGGCCT 


ATGTGTTGAT 


GAATTATTCC 


900 




ATATTTTTAA 


AAACTTTAAT 


GAATCCGTCG 


ACGCTACATG 


GAAAGCACGT 


TGGCAATTAT 


960 




GTATTATTTT 


CAGTTTGTTT 


GCCTT AATGT 


TCTAATTTAA 


CTGGTATCGT 


CATCGATCAT 


1020 


20 


CAACATAGTT 


TGTCAGGAAG 


TGTGTACTTC 


CGTTTAGATG 


ATGATGTATC 


TTTAGCTAAC 


1080 




ACACGTGTAT 


TAACGATAGG 


TGTCGCAGGA 


TTAGTTGGTG 


GCCCTTTTGT 


AGGTCTATTT 


1140 




GTTGGCGTTA 


TTTCAGGTAT 


TTTCAGAGTG 


TATATGGGTG 


GGGCGGATGC 


ACAAGTTTAT 


1200 




CTTATC TCAT 


CTATATTTAT 


CGGNATTAAT 


TGCTGGTTAT 


TTNGGGCTTA 


CAAAGCTCAA 


1260 


25 


AGGACGCAAA 


GCGGTTACCC 


GAGGTATTTG 


GCGAAAAAGT 


GGCCAATGAA 


TTGGGANTTG 


1320 




GTTATTGGGA 


AATGAATCAA 


ATGTNGGAGC 


AATTTTAACA 


TTTTTCCCCA 


CGACAAAGCA 


1380 




TATTGCGGTT 


GACTCATATC 


ATTAATTGCA 


CTACCAATGA 


TTATTGTTAA 


TAGCGTTGGT 


1440 




AC GGCGATTT 


TTATGTCTAT 


TATCATTTCC 


AACATTAAAA 


GCAAGAAGAG 


CAAATGAAGC 


1500 


30 


CTGTTCCAAA 


CACATGACGT 


AC TGC AATTG 


ATGAACCAGA 


CATTGCCGTA 


TTTTAAAGAA 


1560 




GGATTGAATA 


GAGAATCGGC 


ACAGCAAATT 


GCGATGATTA 


TTAAAAATTT 


AATGAAAGTA 


1620 




TCTGCCGTAG 


CAATTACAAG 


CAAAAATGAA 


ATC TTATCGC 


ATGTAGG TGC 


AGGTAGTGAT 


1680 




CATCACATAC 


CAACAAATGA 


AATATTAACA 


AGTCTGTCTA 


AAGATGTATT 


GAAATCAGGA 


1740 


35 


AAGTTGAAAG 


AAGTGCATAC 


TAAAGAAGAG 


ATTGGTTGTA 


GTCATCCGAA 


TTGCCCGCTT 


1800 




AGAGCAGCTA 


TCGTGACACC 


ACTTGAGATG 


CATGGTTCTA 


TCGTCGGTAC 


ATTGAAGATG 


1860 




TATTTTACAA 


ACCCTAATGA 


TTTAACTTTT 


GTGGAACGTC 


AACTTGCAGA 


AGGATTGGCA 


1920 




AATATTTTTA 


GTAGCCAAAT 


TGAACTTGGT 


GAAGC CG AAA 


CGCAAAGTAA 


GTTATTGAAA 


1980 


40 


GATGCTGAGA 


TTAAGTCATT 


ACAGGCACAA 


GTGAGTCCAC 


ATTTTTTCTT 


CAATTCAATT 


2040 




AACACGATTT 


CAGCTTTAGT 


TAGAATAAAT 


AGCGAAAAGG 


CACGAG AG TT 


ACTATTAGAA 


2100 




TTGAGTTATT 


TTTTCAGAGC 


GAATTTACAA 


GGCTCAAAGC 


AACATACGGA 


TTACTTTAGA 


2160 




TAAAGAGTTA 


AGTCAAGTGC 


GTGCATACTT 


ATCACTCGAA 


CAAGCACNGT 


TATCCAGGAA 


2220 


45 


GATTTAATAT 


CAATATTAAT 


GTTGAAGACA 


AATATCGCGA 


TGTGCTTGTA 


CCACCATTTT 


2280 




TAATTCAAAT 


TTTAGTTGAA 


AATGCCATCA 


AACATGCGTT 


TACGAATCGA 


AAGCAAGGTA 


2340 




ACGATATTGA 


CGTGTCAGTG 


ATTAAAGAAA 


CTGCAACACA 


TGTACGTATT 


ATTGTACAAG 


2400 




ATAATGGTCA 


GGGTATTTCT 


AAAGATAAAA 


TGCATTTGTT 


GGGAGAAACA 


TCTGTAGAAT 


2460 


50 


CAGAATCTGG 


AACTGGTAGT 


GCTTTAGAAA 


ATTTAAACTT 


ACGC CTAAAN 


GGATTATTTG 


2520 




GAAAATCCGC 


AGCATTACCA 


ATGTGAATCC 


GACATCGAGN 


GGTACCACTT 


TTTGGTGTGT 


2580 




ACCTTCCTTA 


TGAAAGACAA 


GAGGAGGAAT 


AAATATGAAA 


GCATTAATCA 


TAGATGATGA 


2640 




GCCATTAGCA 


CGCTAATGAA 


TTAACATATT 


TATTAAATGA 


AATTGGTGGT 


TTTGAAGAAA 


2700 


55 


TTAATGAGGC 


AGAAAATGTA 


AAAGAAACAT 


TGGAAGCACC 


TACTTGATCA 


ATCAATATGA 


2760 




CATTATATTT 


TTAGATGTCA 


ATTTAATGGA 


TGAAAATGGG 


ATCGAATTAG 


GAGCTAAGAT 


2820 
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. TCAAAAGATG AAAGAGCCAC 


CTGCGATTAT 


TTTTGCAACT 


GCACATGACC 


AATACGCAGT 


2880 




ACAGGCATTT 


GAATTAAATG 


CGACAG AC T A 


TATTTTGAAA 


CCGTTTGGTC 


AAAAACGTAT 


2940 


5 


TGAACAAGCA 


GTCAATAAAG 


TGCGTGCGAC 


TAAAGCCAAA 


GATGATAATA 


ACGCAAGTGC 


3000 




AATTGCGAAT 


GATATGTCGG 


CGAATTTTGA 


TCAAAGCTTA 


CCTGTTGAAA 


TTGACGATAA 


3060 




AATTCACATG 


TTAAAGCAAC 


ANAATATTAT 


TGGGATTGGC 


ACACATAATG 


GTATTACAAC 


3120 




CATACATACA 


ACGAATCATA 


AATACGAAAC 


AACAGAGCCA 


TTGAATCGTT 


ATGAAAAACG 


3180 


10 


ATTGAATCCC 


ACTTATTTTA 


TACGTATTCA 


TCGTTCATAT 


AG 




3222 




(2) INFORMATION FOR SEQ 


ID NO: 112 : 








15 


(i) SEQUENCE CHARACTERISTICS : 










(A) 


LENGTH: 805 base pairs 










{B> 


TYPE: nucleic acid 












(C) 


STRANDEDNESS: single 










20 


(D) 


TOPOLOGY : 


linear 












(ii) 


MOLECULE TYPE: Genomic DNA 








25 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 112: 








ATCAATGTAA 


CCTAATAGTT 


TATGTCTATC 


TTGTGTACCA 


ACTACTACAT 


CGACACCAGG 


60 




AATTTCCATA 


ATTTCAGCTG 


ATGAAGTTTG 


CGCATAACAA 


CCTGTTACAC 


AGATTACAGC 


120 


30 


ATCAGGATTG 


TGTCTTATTG 


CACGTCTAAT 


TATTTGACGA 


CTTTTTTTAT 


CACCTGTATT 


180 




CGTTACTGTA 


CAAGTATTAA 


TAACAAATAC 


ATCAGCATTC 


GCTTCAAAGT 


CAACGCGCTC 


240 




ATAGTTTGCT 


TCTTTAAATA 


ATTGCCAGAT 


TGCTTCAGTT 


TCATAATGGT 


TTACTTTACA 


300 




ACCTAATGTG 


TGAAACGCAA 


CTGTTGACAT 


AAATATTCAC 


CCCATTAATT 


C TTTTTC AT A 


360 


35 


ACTTATTGCA 


CTTAACGCAT 


ACAATGGCGC 


AGTTTCTGCC 


CGTAAAATTC 


TCGGTCCAAG 


420 




ACCAACAACT 


GTACTAGTAT 


TACTAAATAA 


TGAAATTTCA 


ATTTCTGACA 


AACCACCCTC 


480 




AGGACCAAAA 


ATCATCAACA 


CTTTATCCTG 


AGCCTTGAAT 


TGTTGTAAAG 


TNTGCTTGAA 


540 




ATTGCTTAAC 


TC AC CATC TT 


TTGCTTCCTC 


TTCATATGCA 


ATAAGAATAT 


AGTCNTAATT 


600 


40 


ATCAATAGTA 


TCACAAATTA 


ATTTTAAATT 


CCGACTCCGA 


ATTGAATAGA 


TGGAATCCAC 


660 




TAAACGAATA 


GCTTTGTTCA 


GCAGCTTCTT 


TAATTATTTT 


TTGCCAACGC 


TCTATC TTTT 


720 




TGACAACTTT 


CGCCTCCGTT 


TAATTTAACA 


ATTGAGCGTT 


CCATGCTCAC 


AGCTATAAAT 


780 




GATGAAGCAC 


CCAATTCATA 


GCTTT 








805 



45 

(2) INFORMATION FOR SEQ ID NO: 113: 



(i) SEQUENCE CHARACTERISTICS: 

so (A) LENGTH : 1448 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

5 





GGCACGAGCG 


TAGATTTAGC 


ATTTTCAAGA 


GACCAAGAAC 


ACAAAGAAAT 


ATGTACAGCA 


60 




TCGTATAATG 


GAAGAAAGTG 


AACGTTTCAA 


TGAATGGATT 


GAGCAAGGCG 


CAGCAATCTA 


120 




TATTTGTGGC 


GATGAAAAAT 


GTATGGCGAA AGATGTCCAT CAAGCCATTA AAGATGTATT 


180 


10 


GGTAAAAGAA 


CGTCATATTT 


CTCAAGAAGA 


AGCAGAGTTA 


TTATTGCGAC 


AAATGAAACA 


240 




ACAACAACGC 


TATCAACGTG 


ATGTTTATTA 


GCGATTGGTG 


TTAAATATTT 


TAAGGTGTAA 


300 




TGATGTAAAA 


AG AT AT AAAG 


G ATGTTGC TC 


AACATGAATA 


TGCCATTAAT 


GATAGATTTA 


360 




ACAAATAAAA 


ATGTCGTCAT 


AGTTGGNTGG 


AGGCGTCGTT 


GCAAGTCGTC 


CGGGCACCAA 


420 


15 


ACATTAAATC 


AATACGTTGA 


ACATATGGAC 


GGTCATCAGT 


CCGACAATCA 


CTGAAAAACT 


480 




TCAAAATATG 


GTAGATAACG 


GTGTCGTCAA 


TATGGAAAGA 


AAAAGAATTT 


GAACCAAGCG 


540 




ATATTGTAGA 


CGCGTATCTA 


GTTATTGCAG 


CAACCAATGA 


GCCACGTGTC 


AATGAAGCGG 


600 




TAAAACAAGC 


CTTACCTGAG 


CATGCCCTTT 


TTAATAATGT 


TGGAGATGCA 


TCAAATGCCA 


660 


20 


ATGTTGTATT 


TCCAAGTGCA 


CTACACCGCG 


ACAAGCTAAC 


TATCAGTGTA 


TCAACTGATG 


720 




GTGCGAGTCC 


TAAGTTGACA 


AAATCAATTA 


TGGCAGAGCT 


TGAGGCGTTA 


TATCCACCAT 


780 




CATACAGTTC 


GTATATCGAC 


TTTTTATATA 


CTTGCCGACA 


GAAAATAAAA 


GTACTTGATA 


840 




TAACATATAT 


ACGAAAAGCA 


ACAGTTACTG 


TCACAAATTG 


TGTCACAAGA 


ATATTTAAAT 


900 


25 


CATGACAAAC 


AAGCTCAATT 


TTTAGCGTGG 


TTGGATGTAA 


GATAATAATA 


GCGGACCGTC 


960 




TAACCGTCTA 


AGGTAAGTCT 


TCTTATTTTA 


ACTTTAACGC 


TTAATCATTG 


AAATTAAGAC 


1020 




ATGGGTGGCT 


TTGTGAATAG 


TCTAATAATG 


AAGGATTTAA 


GCGATAATGA 


TATGCGTTTT 


1080 




AAATATGAAT 


ATTACAATAG 


AGAAAAAGAT 


ACGTAGAACA 


AACTTAATAA 


AATAGATGGA 


1140 


30 


TAAATTGAAA 


TCTGGTTGAA 


GTCGTTACTA 


TCATAGCGAC 


CTTTAGCCAG 


ATTTTTTGTG 


1200 




CAATAGAAAG 


CAATAATAAA 


AATGATAGAT 


CAANATGAAA 


TACAGGACAG 


GATATACAAG 


1260 




GATTAGTCAT 


GCCATGTTAT 


CAAGTAGGAA 


AATCAAACTT 


CACTATTGAT 


AGTTACGCAN 


1320 




AAAGATTTTT 


TTGATAAAAT 


GAGATAACTT 


AAAGATAAAA 


AATTATATTA 


ATTATAATAT 


1380 


35 


TTAAGTTAAA 
ATAGTGAG 


GAGGGGGATT 


ATGTAAATTG 


TATTAAAAGT 


GGAGGGAGAA 


AATAATATGA 


1440 
1448 



(2) INFORMATION FOR SEQ ID NO: 114: 

40 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1127 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

CGACATGGAA GAACNTANAC CCTCGTNNAG GAGGCCCAGA AAGGNAGC NT NAAGNNAAAT 60 
55 TAAGNNAAGC GAAAAGCACC AACAATAAAA CCTGATGGCA GCATCATTCA ATGCGTGCCA 12 0 

CCAGGTTTTT ATGTTTTGTC AAGAAAATTA AATAAATCAT TAAATGATTC GGCCCATCGT 180 
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AGGATGTGAA 


TAAATACTAT 


CTCGTAATAC 


GGTATATGGA 


ATGTTTTGAT 


CAATCGCAAG 


240 




TT TAATT AT A 


TTAATTAATT 


CTTCAGATTG 


CTTACCATAT 


AATGTAGCAC 


CT AAAATC AT 


300 


5 


ATTATTTTCA 


TTATTAATGA 


CTACTTTAAA 


TAAACCTCTT 


GGATCATTGT 


TAATTTTGTG 


360 




ACGAGG TATA 


GC AC TTACT A 


AAAGTTGATG 


TTCAGTGTAA 


TCATAATGTT 


GAGCGGCAGC 


420 




TTC TTT AC T A 


GTTAATCCAA 


CACGTGATAA 


TGGTGGATCT 


ATAAATACTG 


TATAAGGC AC 


480 




1 AV_\_ A V_ 1A 


1 lol v_-rtvjj 






AACGCTGATT 


TGATAATTrfi 


540 


10 


ATAATCATCT 


AAAGATATAT 


ACGTAAATTG 


AAGTCCGCCT 


TTAACATCAC 


CTGCAGCATA 


600 




AATATGTGGC 


ACAGATGTTT 


GAAGTTGAGC 


ATTGACTTTA 


ATTTCGCCTC 


TATCACCTAA 


660 




TTCGATATCA 


GTATTTTCTA 


AAGCTAAGGC 


TGTATTCGGT 


TTGCGCCCGA 


TAGCCAAAAG 


720 




TACTGCATCA 


GCCTCANAGT 


TGCCAACATT 


AGNACGGACT 


GTTGTATGAT 


GATCGTCAGA 


780 


15 


TGACAATTCA 


GTCGTTTCAA 


CATTTGTATG 


CAATGCAATG 


CCTTTATTTT 


CTAAGTCAGT 


840 




AATAGCATGT 


GCANCGACAT 


CTTGATCTTC 


GCCGTGGCAT 


AAATGATTCA 


CCACGTTCTA 


900 




ATACNGTTAC 


CTTACTACCT 


AAATTCGCAA 


ACATTGAAGC 


AAATTCTAAG 


GCAATATAAC 


960 




CGCCACCTAC 


AATAACGAGA 


TGCTTAGGTT 


GATAGCTAAT 


GTTTAATAAA 


CCTGTTGAAT 


1020 


20 


CGAAGACATG 


TTTAGCTTGA 


TCAAGGCCTT 


TTATGTTAGG 


AATGATAGAG 


GTAGC AC CGG 


1080 




TATTAATAAT 


GATATGAGGT 


GCAGTAATAC 


TAGCGACGAT 


ATCGCCG 




1127 



(2) INFORMATION FOR SEQ ID NO: 115: 

25 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 876 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 





GAAGANATAN 


CGTGGTNTTG 


AAGATTTGGG 


ANGTTAATAG 


ATNAAGGGGA 


ATATGCANTT 


60 


40 


ATTAC CGGGA 


GNNNAGTATC 


CAAGNAAATC 


TAGAGTCAAT 


AGGTTAATAA 


TCTTATGCCT 


120 




TTTTAATTTT 


GAATAAAAGT 


GGGGTGGTGT 


AATGTTTGGA 


TTTACCCANC 


CGACCCGAAC 


180 




AAGATTGNCG 


TTTNACGCGA 


TTAGAAGNAA 


ATGATAAGCC 


TATGTTTGNN 


AAATTCGACA 


240 




GAATAGAAGA 


CAGTCTGAGA 


ACGCAAGAAA 


AAATTTATGA 


CAAGTTAGAT 


AGAAATTTCG 


300 


45 


AAGAACTAAG 


GCACGAGCGA 


CAAGGTAGAA 


GATGAAAAGA 


ATAAAGAAAA 


GAATGCCAAA 


.360 




AATATTAGAG 


ACATAAAAAT 


GTGGATTCTA 


GGATTAATAG 


GGACGATCTT 


CTAAGTACGA 


420 




TTTGTCATAG 


CCTTAC TAAG 


AACTATTTTT 


GGTATTTAAA 


GGAGGTGATT 


ACCATGCTTA 


480 




AAGGGATTTT 


AGGATATAGC 


TTCTGGGCGT 


GCTTCTGGTT 


TGGTAAATGT 


AAATAACAGT 


540 


50 


TAAGAGTCAG 


TGCTTCGGCA 


CTGGCTTTTT 


ATTTTGATTG 


AAATGAGGTG 


CATACATGGG 


600 




ATTACCTAAT 


CCGAAAAATA 


GAAAGCCCAC 


AGCTAGTGAA 


GTGGTTGAAT 


GGGCGTTATA 


660 




TATCGCTAAA 


AACAAAATAG 


CTATTGATGT 


ACCTGGTTCT 


GGAATGGGAG 


CACAATGCTG 


720 




GGAATTTACC 


TAATTATTTA 


CTCCGATAAA 


TATTGGGGAT 


TTAGAACATG 


GGGAAATGCT 


780 


55 


GATGCTATGG 


CTCAGAAATC 


CAATTAATAG 


AGGTAGAGAT 


TTCAAGATAA 


TTAGAATCAA 


840 




AAGACTTGGT 


ACACAACCAG 


GCGACTGGGT 


TTTTGG 






876 
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(2} INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1789 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 



20 



25 



30 



35 



40 



45 



50 



GGCACGAGCG 


GCACGAGCGT 


GTTGTATCAA 


GATTTTGTAG 


GCAGTTTTAC 


AACGTCCGAT 


60 


TCAGCAAGTT 


ATGCACAAGA 


TTTTAAATCT 


GAGGAAAACG 


CTAAAAAGAT 


TGCTGAAACT 


120 


TTAAATCTTT 


TATATCAATT 


AACAGGCAAT 


CAAAACGGTG 


TGAAAGTTGT 


GAAAGAAGTT 


180 


GTGGATAGAA 


CTGACTTGTC 


ATCTGATAAA 


TCAGTTGATA 


GCGAAACAAT 


GTAACTATAC 


240 


TAAGTTATGA 


GCATTACGCT 


CATAGCTTTC 


TTAGAAAGTA 


GGTGTAGTTT 


TGGATGATAT 


300 


TCAGAAAATA 


AAAAAAGAGC 


TTTC TGAATT 


AGTTGAACGT 


GTTGATGATG 


TTGAAATACT 


360 


AGCAAACGAA 


ACAGCTGATC 


ATGTGCTTGA 


ACTTAGAGAG 


GAACATAAGC 


AACATCATAA 


420 


TGAACTAAGA 


GAATCTCATA 


AAGAACTTAA 


AGATAAGCAA 


GATAAAGTTG 


TAGATGAGAA 


480 


TTTAGAGCAA 


ACAAAGATAT 


TAAACAGAAT 


TGAAGAAAGA 


TATCANACGC 


AAGTAGNTGT 


540 


TGNGCAAAAA 


AATGAAGAAA 


AGACACTCGC 


CCAAAATAAA 


TGGCTCGTAG 


GTGCCATATG 


600 


GGCGCTTGTA 


ACAATTGTTA 


TGATTGCAGT 


CATTACTGCA 


TCAATTNCTG 


CGTTATTACC 


660 


TTAAGGGAGG 


TGGACATAAT 


GAGTTGGGCA 


AGATGGTTAT 


CATGTTATTT 


GTNTGGTCGT 


720 


AAATGTAAAT 


AATGTTTTTG 


GTCAGTGCAT 


CGGCACTGGC 


TTTTTATTTT 


GATTGAAAAG 


780 


AGGTACGTAC 


ATGGTATTAC 


ACAGCTCACA 


AGACAGGAAG 


CATACTCCAA 


GTGAAGTTGG 


840 


GAAGTGTTGT 


TAATACCAAG 


TAAGTAGGAT 


ATCTGANATG 


TATAATAGAG 


TAAAAATGAA 


900 


ATCTTTTTAT 


TATAGACACA 


TATAAAAAGT 


GTATAGTAAT 


ATATGTATGT 


ATAATTAAAT 


960 


GATAATCATT 


TCATAATTAT 


TGTATATAAC 


TAAATAACTA 


C TTAAC ANAA 


ATAATTATGC 


1020 


TTTAGAGNTG 


ACCANNATGA 


NNNANNCCAG 


CATTTACATT 


ACTTTTATTC 


ATTGCCCTNA 


1080 


CGTTGACNAC 


AAGTCCCANT 


TGTAAATGGT 


AGCGAGAAAA 


GCGNAGNAAT 


AAATGCGAAA 


1140 


GATTTGCGAA 


AAAAGTC TGA 


ATTCCAGGGN 


ACAGCTTTAG 


NCAATCTTAN 


NCANATCTAT 


1200 


TATTACNATG 


NNANAGCTAN 


AACTGAAAAT 


AAAGAGAGTC 


CNCGACCACA 


TTTTTACAGC 


1260 


ATACTATATT 


GTTTANAGGC 


TTTTTTACAG 


ATCATTCGTG 


GTATANCGAT 


TTATTAGTAG 


1320 


ATTNTGATTC 


NNAGGATATT 


GTTNATAAAA 


ATAAAGGGNA 


AANAGTAGAC 


TTGTATGGTG 


1380 


CTTATTATGG 


TTATCAATGT 


GCGGGTGGTA 


CACCACACAA 


AACAGCTTGT 


ATGTATGGTG 


1440 


GTGTAACGTT 


ACATGATAAT 


AATCGATTGA 


CCGAAGAGAA 


AAAAGTGCCG 


ATCAATTTAT 


1500 


GGCTAGACGG 


TAAACANAAT 


ACAGTACCTT 


TGGAAACGGT 


TAAAACGAAT 


AAGAAAAATG 


1560 


TAACTGTTCA 


GGAGTTGGAT 


CTTCAAGCAA 


GACGTTATTT 


ACAGGAAAAA 


TATAATTTAT 


1620 


ATAACTCTGA 


TGTTTTTGAT 


GGGAAGGTTC 


AGAGGGGATT 


AATCGTGTTT 


CATACTTCTA 


1680 


CAGAACCTTC 


GGTTAATTAC 


GATTAATTTG 


GTGCTCAAGG 


ACAGTATTCA 


NATACACTAT 


1740 


TAAGAATNTA 


TAGAGATAAT 


AAAACGATTA 


ACTCTGAAAA 


CNTGCGTAG 




1789 
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(2) INFORMATION FOR SEQ ID NO: 117: 
(i) SEQUENCE CHARACTERISTICS: 

5 

(A) LENGTH: 1410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

15 



20 



25 



30 



35 



40 



45 



CGATCTACAT 


TCGCTGCGCC 


AATACACACA 


ATAGGATAAT 


CTTCATTTAA 


AAC AT ATGC C 


60 


TTTCCCATAA 


CATATTCCTT 


TTGTATTAAT 


CCTGAAATAA 


TGTTTGC T AC 


GCTGGGTCTA 


120 


GATAATCCAA 


TTGCCTCAGC 


AAGTTCACGT 


TGTGAAATAA 


NCGGATTATC 


TTTAATTCTT 


180 


TTTAAAATTT 


CTTTCTCAGN 


ATCGCTCATA 


ACCATCCCTC 


CTCATATTGA 


C TT AAAAT AC 


240 


TTGAATCAGT 


GAATTCACCA 


AAAATATCGT 


AGCATGCTGT 


TATTTCTGTT 


GTAAACGCTT 


300 


ACATTTTNCC 


GTTATATCAA 


TGATATTTGT 


TTATAAAATA 


CGCATAATTT 


CGTAAAAAGT 


360 


AAGTTCATTG 


CCATCGTGTA 


AAAGNTACTC 


GAGCAAATGA 


TAAATGTTTA 


TACAGTATGA 


420 


TATTTGTCGA 


CAATAAATTA 


TGACATTTTT 


ATGTC TT AT A 


TTTTTCTATA 


AAAAAACGTG 


480 


ACTCATTATC 


GATTTTTTCA 


TCGACTGAAT 


CACGTTACGA 


TGTTTTATTT 


GTTTTTNCTA 


540 


TTCAATATGA 


TGATGCTATG 


CGTCCAACTG 


CACCTCGATT 


AATCATGTCT 


TGCTTAGTAA 


600 


CAATTGGATA 


TTTTTTAAAT 


ACGGATTGAA 


CCCAAGATAA 


CCCTGCAAGC 


GCACCCTACA 


660 


ATCGCCATTA 


CTACACCATA 


AATTACAATA 


GTCATTGGAT 


GATTAAATCC 


ANACATTACT 


720 


AAAAATCCTN 


CAATCGGTGT 


AGCTGTACCT 


GTCGCATCGT 


TAATTAAACC 


TGACATAGCA 


780 


ATAATTAAAC 


CAGCAATCGC 


ACCACCAAAG 


AAATTTGTAA 


TATAGATTGG 


AATTGGATTG 


840 


GCTGATACAA 


TATCTGCTTG 


TGATAAAGGT 


TCAATACTTA 


CTGCAATCGT 


AGACTTACGA 


900 


TCACCTAATT 


TTAAGCGATG 


GAATAGCGTC 


CCATTCATAA 


ATGC C GAACT 


AAATGCTGCC 


960 


ATGGCACCAA 


TAGCCATAGG 


TACACCCGTT 


AAACCTAATA 


ATGCTGTCAA 


TGCCATTGAA 


1020 


CTCAATGGCG 


CTGTGCCGAC 


AACCGTGATA 


ATACCACCTA 


AAATG AT AC C 


CATGATAATT 


1080 


GGATTCGTAT 


TCGTACTACT 


TTGGATAATA 


TCACCAATTC 


GAATCAATGT 


ACTATTCACT 


1140 


ACTGGCGTTA 


ATAATACTGC 


AATAAGACGA 


GCTAATGGCG 


CTAAGATGAC 


AATCGGATCC 


1200 


AATTAAGTCG 


ACAACCATCC 


GGGCACATAT 


TTCTCTTGTA 


TATTTCATCA 


CATATCCAAT 


1260 


CAATATTTAA 


CCCGNAAAGG 


AAACCTGGGT 


AATAAATCCA 


TACCACCGCA 


ACGCTCGGGC 


1320 


CGG TTATTAA 


CTTAATGGAT 


ATTCTGGGGG 


ATACAACCGA 


TTAGGTTAAT 


GCTTGTTGAA 


1380 


TCCCAGGCGG 


GAGGGG AC AA 


CAACCTAGGA 








1410 



(2) INFORMATION FOR SEQ ID NO: 118: 

SO 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 926 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

5 



10 



IS 



20 



25 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 118: 






AATTATTCCG 


GTTTCGC CGT 


CTTTGTTTGT 


GGCTATGTTA 


CATTCAACAA 


TAGATTTGCC 


60 


AGTGATACTG 


X IV. 1 1 I 




■rt. 1 Art 1 rt 1 1_ AV 




GCATTGCTAA 


120 


ACTTGCATCG 


GCTTCTATTC 


CGCCTGATTC 


TTTCATGTCT 


GATAGCATTG 


GTCGTTTATC 


180 


CTGCCTAGAT 


TCAACACCAC 


GATTCAGTTG 


TGAAAGT AG T 


ACGATGATTG 


CGCCTGTCTC 


240 


GTTAGCGATT 


ATCTTTAAGT 


CACGTGATAT 


CTTTTCT AC T 


GCTACACGTC 


TATCAACTTT 


300 


CGCATCAGTA 


TCCATCAGTT 


GAAGATAATC 


TATAAAAATA 


ACTTGGTTGC 


TCGGTCTGAA 


360 


TGCCTCATTG 


GTTTGAGCTC 


GCACATCTTG 


CGGTGGTAAT 


ATTTACTTTT 


GGTCAAGAAA 


420 


TATCAAATAC 


CTAACTTCAA 


TGATTTGGAT 


CCCATTGCAT 


TTGTTAACTT 


CGTTAAGTCA 


480 


TCCGGTGTTA 


AGTTCCTGAT 


TTCTTTTATC 


TTGGTTAACT 


CAATACCAGT 


AATTG TTGAT 


540 


AACATACGTT 


TCAATACTGA 


TGTGCCAGTT 


GTTTCAAGAC 


TAAAGAAAGA 


TGTTTTATAT 


600 


CCATTTCGTG 


CTATGTTCAA 


CATCATGTTT 


AATGCAAAAC 


CTGTTTTACC 


CACTGAGGGA 


660 


CGCGCGTGCC 


ATGACGATTA 


ATTGCGACGG 


CTCTAATCCC 


CCTATTTTGT 


AATCCATGAG 


720 


CTTATAACCC 


GTCTTAATTT 


GCTTCTTAGG 


GCTATCGCTG 


TATAACTCAT 


CAACAAAC TC 


780 


CTCAACAAAC 


TTCTTGGTTC 


CATCTTCTTT 


TTTGTTAGTA 


ATCGTTTTTA 


AATCCTTGAG 


840 


TTCATCAATC 


AAGTTGTTAA 


AGTTTTGGTT 


CGTAGGTTGT 


TGTTTAAACT 


CAGTTACCAA 


900 


TTCGTTAGCT 


TTGTTGAGCT 


CGTGCC 








926 



(2) INFORMATION FOR SEQ ID NO: 119: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1406 base pairs 

(B) TYPE: nucleic acid 

35 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 



45 


CCAATAATCC 


ATTNCAAGGC 


TTTTTCGTTG 


CCCCAATNAA 


GAAGCACGAG 


ACCCATCCCG 


60 




GCTATTAAAT 


ACCTGCAATT 


TGAAATTTGC 


CAAGGGCACC 


ATGGTGTGAC 


CG AC AAG TAT 


120 




TAAAATCCTG 


CACCAGCGAA 


AAGCATTACC 


TAATGAATTA 


TTATCATTAA 


TCGATATTAT 


180 




TGTGCCAAAC 


GAAACAGAAG 


CCGAATTGTT 


ATCTGGGATT 


AAAGTAACTA 


ATGAACAATC 


240 


SO 


TATGAAAGAC 


AATGCCAATT 


ACTTTTTATC 


TTTAGGCATT 


AAGACTGTTT 


TGATTACGCT 


300 




AGGTAAGCAA 


GG T AC AT ATT 


TTGCTACTAA 


AAATCAAAGC 


CAACACATCG 


AAGCTTATAA 


360 




AGTAAATGCG 


AT TG AT AC AA 


CTGCTGCAGG 


CGACACATTT 


ATTGGTGCAT 


TTGTCAGTCG 


420 




CTTAAACAAG 


TCGCAAGATA 


ACTTAGCAGA 


TGCTATTGAT 


TTTGGTAATA 


AAGCGAGCTC 


480 


55 


ACTCACTGTA 


CAAAAAACAC 


GGCGCGC AAG 


CATCTATTCC 


TC T AC TAG AA 


GAAGTAAATC 


540 




AAGTTTAAAT 


GAATCAAACA 


CAGCTATGAT 


ATGAAGGTTT 


AGCATATAAC 


ATGCAACATT 


600 



108 



EP 0 841 394 A2 



5 



10 



15 



CGTATATCAT 


GGCTGTGCTT 


TTTTATCTTT 


ATAAAACATC 


ATCTATTAGA 


AATAATTTTC 


660 


CACTAAACCT 


ATGCTTGTTG 


ACTCATGTTT 


AGTTATAAAT 


GAAGTGACAA 


TTTTTTGTAA 


720 


TCTTTTTAAC 


TTCCAAATTA 


TTGCATATAA 


ATATGCTATA 


TTAATGATAA 


TAATTATCAA 


780 


TTAAAAGGAG 


GTTATGCTAT 


GTCTAAAGAA 


GCTGGTCATA 


CATTTTTAGC 


TAAATTAGGA 


840 


AAAACTCGTC 


TACGCCCCGG 


TGGGTAAAGA 


AGCAACAGAT 


TGGNTTAATA 


CANCAAGGGG 


900 


CATTTTCACA 


AGATAAACAA 


G TGTTAGAAG 


TGGCATTGTA 


ATATTGTGCA 


CAACATCTAT 


960 


TTATCTAGCT 


CATACATATG 


GCTGTCACAT 


TCAAGGCGTT 


GGATATAAAT 


AAGAAAGCAT 


1020 


TAGAAAAAGC 


ACAGGAAAAT 


TTCCAGCAGC 


AGGTCTTGAA 


TCATATATTC 


AAGTTCAACA 


1080 


AGCGAATTGC 


TGTTAAATTG 


CCCTTTGATG 


ACAATCAATT 


CNATGTCGTT 


TTAAATGAAG 


1140 


CAATGTTAAC 


AATGTTACCC 


ATCGCCATAA 


AGGAAAAAGC 


ATTACGCGAG 


TNCTACCGAG 


1200 


TCATAAAGCC 


NGGGGGTNTN 


TTGTTAACCA 


CATGATNTTG 


TCATCGTTAA 


TGAATCANAT 


1260 


NCCACANATG 


TTGTTAAATC 


ATTATCTGCA 


GCAATTAATG 


TCNATGTCTC 


AC CGC AGACG 


1320 


AAACTTGGCT 


GGTTAGATTT 


ATATCATCAA 


AGCTGGTTTT 


AATCAGGTGC 


ATTATCCATA 


1380 


CTGGTCCATG 


AGTTTAATGA 


CACCAA 








1406 



20 

(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 654 base pairs 

25 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 



35 



40 



45 



GGACGTAATN 


ATACGCCTTT 


TCTGATTATT 


TCTTTACTAT 


TCTAAAAACT 


GTTGTTCTAC 


60 


TAATACCTAC 


CATTTTGAAA 


TTTCATTTGG 


CGTGTACATT 


CTACTTGCGA 


ATAATTGTTT 


120 


AATATATTTA 


CGTTTATCTT 


CAGAAAGTTG 


AGGTCGTCCT 


CCATACCTAC 


CACGTTCTTT 


180 


TGCAGATTCT 


AGACCATTTG 


TTGTTCTTTC 


TTTTAATAAT 


AT ATTTTC TG 


AATCTGCTAA 


240 


CATTTTAAAT 


ACTTCAATAA 


ATGATTGTCC 


TTCTTCACTA 


TTAGTATTTA 


TGCCTAAATC 


300 


AAGCACTTCT 


AAATGGATAT 


CATATGGTAA 


TATTCTACTT 


TCAAAAAATT 


CCATGAGTTT 


360 


TTTTGGTGTG 


AATCC TAGT A 


CTTTTAATTT 


TTCTATC ACT 


AATGTATCAT 


CTGTTTTCAA 


420 


CTCGTCTAAA 


CAATGCTCAA 


GTTCTAGTTG 


TTCTTTATAA 


TAAACATTTA 


CTTGCTCAGA 


480 


ATAAACTAAG 


TC AC AAC CAT 


TTATTAAGAG 


TTTCGTTAAT 


TGAGTATCTA 


GATTTTGATC 


540 


AGGATATGCT 


ACACGTGCGT 


AACCAATTTT 


CCTCATTTTA 


TACACCCTTC 


CAGCTTATTC 


600 


ATTTTCTCCG 


ATTAATAAAC 


CTTCGCTCGT 


GCCGTTTCAA 


CAAGTCGGGC 


CCTG 


654 



so 

(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 1213 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121; 



10 



15 



20 



25 



30 



GCAAAGGCTT 


TGAAGATATG 


AGTGGTAAAG 


AATGTACCTA 


AGGATGACAT 


TGAATAGAGG 


60 


GAGTAAAGAA 


AGCGACAGCT 


GATGCATCCT 


CGCTAGTTTG 


GACGGCATGA 


ATAGAATGAC 


120 


GTTAGGAAAT 


GAAGTTCTGA 


CTAACAAAAA 


GATTGAAACT 


GAAATCAAGA 


TGCTTGAGAA 


iao 


AAAAATTGAC 


CAAATGGATA 


AATCAGAAAA 


TAATTCACAA 


GAAGCAGAAG 


TTGCTAAAGC 


240 


ACTTATTAAG 


TTAGCGGGTG 


TTAATAATGA 


TTAATGAAAT 


GTTAAACCCG 


AAACAACAAG 


300 


AAGTCTGGAC 


CTGCTTTATA 


AACGATAAAC 


CCAAAGTATT 


AATAGCGAGT 


GG TGCCAAAA 


360 


GGGCAGGTAA 


AACATATGTG 


TTCATCCTGC 


TTTTTTTAAT 


GCATATAGCT 


ACTTATAAAG 


420 


ACAAGGGGCT 


TAACTTCATT 


ATTGGAGGAG 


CAACACAAGC 


ATCTATAAGA 


CGTAACATAC 


480 


TAGATGATAT 


GGAGTTAATA 


CTAGGTAGAG 


AGTTAACACT 


CGACAAATCT 


AACGCAGTCA 


540 


AAATATTCGG 


TAATAAAGTG 


TATGTATTCG 


ACGGACAAAA 


CTCGGATGCA 


TGGAAAAAAG 


600 


CGCGTGGTTT 


TACTTCAGCA 


GGTGCTTTTT 


TAAATGAGGG 


AACAGCATTA 


CACAATATGT 


660 


TTATTAAAGA 


AGTGTTCTCA 


CGTTGTAGTT 


ACAAAGGCGC 


GAGAATATTA 


ATTGATACCA 


720 


ACCCCGAAAA 


CCCGATGCAT 


CCAGTTAAAA 


AAGATTACAT 


TGATAAGAGT 


GGTCAACGGT 


780 


TATCGAATGG 


AAGACTAAAT 


ATCAAAGCAT 


TCCAATTTAC 


TTTGTTTGAC 


AATACATTTT 


840 


TAGATGAAGA 


ATATATTGAA 


TCGATTATCG 


CAAGTACACC 


AACAGGAATG 


TTCACAGATC 


900 


GTGATATTTA 


TGGTAAGTGG 


GTTTCTGCTG 


AAGGTGTTGT 


ATATAAAGAT 


TTCAAAGAAA 


960 


AAGTTCATTA 


CATCACAGAA 


GAAGAATTTA 


AAACTAAACA 


AATAAAAAGG 


AAATATGCAG 


1020 


GCGTCGACTG 


GGGATATGAG 


CATTATGGTT 


CTATTATGGT 


TGTAGCGGAA 


GACTTCGACG 


1080 


GAAACAAGTA 


CGTTATTGAA 


GAACACGCAC 


ACAGACATAA 


AGAAATAGAT 


GACTGGGTAG 


1140 


C T ATTGC AAA 


AGAGTTATAA 


AAGCATGGCG 


ATATCCTTTT 


TATGTGATAC 


GCTAGACCTG 


1200 


ACATATTGAC 


ATT 










1213 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:122: 

CATTATTATT TGTCGTTAAA TACAATAGAA AATACTATAC CTGTATATGC AATTCGACAA 60 

TAGATAAATT ATTAAATATG CTTACAACAA TCTTAATATC CTTTAACGCA CTACAATAGT 120 

GCTCTGATAA TAGGTTATAA ATGTACGTAA AACCATTGTC TCAATAAAAA TGAAAACGTA 180 

TACTTCAAGA AGGATGGGTT ACTTAATATA AACAAGGGGG TAACATATAT GAC TTT AT AT 240 
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TTAGATGGTG AN AC AC T AAC AATTGAGGAT 
ATTGANATTA TTGATGATGC GTTAGAACGT 
ATTATTGAAA ATGANGAAAC GGGNTTGACG 

S AGTGATGTAC GTATTAGCAC CCGACGCAAT 

CACATGCCTG TGGACTAGGT GAGCCATTTC 
NACGATTGAA TACATTATTA AAAGGTCATT 
TTACAATTTT TTATAAATGA ACGTATTATA 

to GCATCAGGAG ATTTAGCGCC ATTATCACAT 

TATTGTACAG AGGGGAAGAA AAGGATAGTG 
CTTTGAACCT TCAGGCTAAA GAGGGTTTAG 
CTCAAGGTGT CATTAGTTAT ATAGAAGCAG 

15 CTGCATTAAC GCATCAGTCT CTTAATGGCA 

CTGTTCGTAA TTTTCAAGAA CAGATTAATG 
GATCAACATT AACGACGCGA C AAGC AG AAA 
GTATACCACA AATCCATGGC GCTGGTGCCC 

20 A 



ATTAACTCAT TTTCACAACA ACACTCAAAG 300 

NTCAAAAAAA GTAGAGCGGG AGGTGAACGT 360 

GGTATCACTA CAGGGTTGNG GGGTTTATTT 420 

ATNATGAATN ACAAGTGAAT CTGATACGCT 480 

CAAAAGAAGT AGCATTAGTC ATGATGATTT 540 

C AGGTGC C AC TTTAGTAATT AGTGAGACAA 600 

CCGATAATCC CACAACAAGG CTCTCTCGGT 660 

TTAGCATTAG CATTAATTGG TGAAGGGAAG 720 

ACGATGTATT AAGAGAATTA AATAGACAAC 780 

CATTGATTAA TGGTACGCAA GCTATGACAG 840 

AAGATTTAGG TTACCAATCT GAATGGATTG 900 

TTATAGATGC ATATCGACAT GATGTGCACT 960 

TGGCAGCGCG TATGCGTGAT TGGTTAGAAG 1020 

TACGTGTACA AGATGCATAT ACGTTGCGTT 1080 

AAGTATTCAA TTATGTTAAA CAGCAATTAG 1140 

1141 



(2) INFORMATION FOR SEQ ID NO: 123: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1258 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 



40 



45 



50 



55 



TTTGTGCTTC 


GTCAATCTTG 


TCACCANCTA 


ACTCTAAACG 


CTTTC CTTTT 


GTCGTTTTCG 


60 


GGNTCCCACC 


ACAATTAATC 


TGTTTTGATT 


TCTGACATAT 


CAATCTTCCT 


AGTTAACCCA 


120 


AGCCAAATCT 


TTACCATACT 


CAATTTCTTT 


ACCTTGGAAC 


AAGCTGTTTT 


TCTTGTTGAG 


180 


TACCACATAT 


CT AC CTTTGA 


CGGTATTAGA 


ACT AAGC TC T 


ATATAAAAAT 


CCAATGCCAT 


240 


TTTATAGGTT 


GTACATAATT 


GCTTTAAAAC 


TTCATATCTA 


GTNTGATAAG 


AAGTCCATGA 


300 


CGTAGTACGT 


AAGCCATCGT 


ATTCGGTNTG 


TTCAGAAACT 


TCCCAACCTG 


TATCGCTCAA 


360 


CACATCTTTC 


AATGCTTCTG 


AAGTTGTCTT 


TTTCTCAAAT 


TTGCCTGGTG 


CATACGGTTT 


420 


AGCTGTTGTT 


ATATCAGCAA 


GATAAGACGC 


TATACATTCT 


ATCTCTGTGT 


AGCCGTCCAT 


480 


CGTATCTTGA 


ACCCAGTTAA 


TAATAAATTC 


ACGCCATTGT 


TTGTTTGAAT 


CCCTTATAAT 


540 


AACACGATGT 


CGTTCACGGA 


ACNTTTCAGC 


TCTTTCTGAT 


GATATGAGCA 


GTTCAAGCAT 


600 


TTCTGAATTG 


TCATTAACAT 


TACGTTTATG 


AATCGCTCTA 


ACTAAGGAAG 


GATCATCAGT 


660 


AGAAAGGAAA 


TCTATAATCT 


TGTCGTTAAA 


ATCTAAAACA 


TGTATCACAC 


TCTCATCTCC 


720 


TTTCTATAAA 


TATCTATCTT 


GCCATTTAAC 


CGTCGTATCA 


AAGACGTTTT 


CAGGTTGTAT 


780 


GATTAATTCA 


CTGTACCCAG 


AATCAACATT 


GAAATAATTA 


CTTCCAAACG 


ATTTCTCGCT 


840 


CCAACATTGG 


TTCCTCATTT 


GATGACAACA 


CTTNNNGCTT 


GNATATCTAT 


TTTC AC T AAA 


900 
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5 



TCACCTTTTT 


GTATAATGAC 


ATCCC TTGCG 


CCTTTCGGNT 


TCGGTAGAAT 


CTCCGTATTG 


960 


AATGAACCTA 


ATCCATTCAT 


CTCCATCCAC 


TTATAACCGT 


NATACTTCGC 


AC T ATAGAT A 


1020 


GCTATGATAG 


AAGCTGGACG 


CTGATAAAAC 


TTACCGCCAT 


CTATCCACTC 


TTTCTCATCC 


1080 


ATATCAATAG 


GTTTACGTCT 


ATCTGGGTCT 


TTAATGTGAT 


CAAATTTCCA 


AGTTTTAATA 


1140 


GAAAATTTAT 


TACCTACTCT 


TCTGAGCCGC 


ATATAAACAN 


CGATTCTGTC 


CAAGTTATAC 


1200 


ATTATCGGTT 


TATTCTGATA 


GTCGTATATC 


TTTTTGGGGT 


CTCCTTTTTG 


GTTATACG 


1258 



(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:124: 



30 



35 



40 



45 



50 



55 



TCGAATTTGG 


TGTTGNCCGT 


TCCTGTATNA 


CATTCTGNCT 


TATCTAATGG 


GGGACGTTAT 


60 


GATGAGTGCC 


CACNNATTAG 


GGATGATCGT 


NCGAGAGTAA 


GTGTTGGTGC 


AAGGTCAAGT 


120 


GTGTTCGCAC 


CTTTCCACAA 


ANTTAGGGTT 


AATCATCATT 


GATGAAGAAC 


ATGAATCTAC 


180 


ATATAAACAA 


GAAGATTATC 


CGAGATATCA 


CGCTAGAGAA 


ATTGCCCAAT 


GGCGAAGTGA 


240 


ATATCATCAC 


TGTCCAGTCA 


TTTTAGGAAG 


TGCAACACCA 


TGTCTTGAAA 


GTTATGCACG 


300 


AGCTGAAAAA 


GACGTTTATC 


ATTTGCTATC 


ATTACCAAAC 


AGAGTGAACC 


AACAAGCTTT 


360 


ACCTGAAATT 


GATATAGTAG 


ACATGCGTGA 


AGAATTGAGT 


GAAGGTAATC 


GGTCAATGTT 


420 


TTCAAAAGAT 


TTACGTGAAG 


CCATACAATT 


AAGATNAAGA 


TCGACAGGAA 


CAAGTTGTTT 


480 


TATTTTTAAA 


TCCGACGTGG 


TTATGCATCG 


TTTATGTTAT 


GTCGGGATTG 


TGGATATGTA 


540 


CCGCAATGTC 


CAANCTGTGA 


TATTTCATTA 


ACGTATCATA 


AAACGACAGA 


CTTATTAAAA 


600 


TGTCACTATT 


GTGGTTACCA 


AGAGACGCCA 


CCGAATCAAT 


GTCCAAATTG 


TGAGAGTGAA 


660 


CACATTCGAC 


AAGTAGGTAC 


TGGTACTCAG 


AAAGTTGAAG 


AACTATTGCA 


ACAAGAATTT 


720 


GAAGATGCGC 


GCATAATTAG 


GATGGATGTA 


GATACAACCT 


CAAAGAAAGG 


TGCACATGAA 


780 


AAGTTATTGA 


CTGAATTCGA 


AAAAGGTAAC 


GGTGACATTT 


TACTAGGTAC 


TCAGATGATT 


840 


GCGAAAGGAT 


TAGATTATCC 


AAATATTACT 


TTAGTTGGTG 


TGCTGAATGC 


NGATACAATG 


900 


TTAAATTTAC 


CTGATTTTCG 


GGCGAGCGAA 


CGTACTTATC 


AAC T ATT AAC 


GCAAGTGGCT 


960 


GGTAGAGCTG 


GTCGTCATGA 


AAAGGCAGGT 


CAAGTCATCA 


TTCAAACGTA 


TAATCCAGAT 


1020 


CATTATTCAA 


TATTGGATTG 


TTCAAAAAAA 


TGATTATTTA 


ACATTTTATC 


GTCAGGAAAT 


1080 


GGAATATCGT 


CAAATTAGGA 


AAGTATCCAC 


CGTATTTATT 


ATTTGGTTAN 


TTTCACAATC 


1140 


TCACATAAAG 


AAATGAAGAA 


GGTTATGGAA 


GCATCGCAGC 


ATGTTCATAA 


AATTTTATTA 


1200 


CAGCATTTAC 


AAGAAAAAGC 


GCTTGGTACT 


AGGTCCATCC 


TCCGGCAGCA 


CTTGCGAGAA 


1260 


TCAACAATGA 


ATTTAGATTC 


CCAAATTTTA 


GTGAAATATA 


AAAGTGAACC 


TGGATTATTA 


1320 


CAAGCCANTC 


AGTTTTTAGA 


TGACTATTAC 


CATGAAAAAT 


TTATAAAAGA 


AAAATTAGCA 


1380 


TTGAAGATTG 


ATATTGGANC 


CACAGATGAT 


GATG TAAC AT 


TACTAATTAT 


TAGAAACAAG 


1440 


NNCAAGTATT 


GTACGAGTAT 


TTGAACCCAG 


TGTGTGAATA 


TTTACTTNAN 


GNACAAGAAA 


1500 
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AAGGGCAGAA TATACAACTG TTAACTATTT AAATTAGCAG TTTATATTCT GTCTTTTTAT 15 6 0 
ATGGCTTTAT AACTTACGTG ATTTTGGTTT GATAAGGAAT TTATTAGTAT TTTCATTTAC 1620 

(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 875 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 



25 



30 



35 



AATTGGGCAC 


GGGTANACAG 


TTGTAGCGTT 


ATTAAGNATT 


TTAACTTTTG 


CGAAGTTAGC 


60 


AACAATTTNC 


CAATGATGTT 


ATTTTATCAA 


TGTTACCTCA 


AGC AGC AAC T 


ACAANCGATT 


120 


GCGTTAC CAG 


TATCAGCTGG 


TATCGGTGGT 


ATAAAAGAAT 


TAACATCATT 


AGCAGNTATT 


180 


TTAAATGGTG 


TCATTATTTA 


TGCCCTAGGT 


AATAAATTCN 


TGAAGCTTTT 


CCGAATTACT 


240 


AACCCTATTG 


CCCGAGGATT 


AGC AC TTGG A 


ACNAGTGGTC 


ACACATTAGG 


TGTAGCACCA 


300 


GCCAAAGAAT 


TAGGACCTGT 


AGAAGAATCA 


ATGGCAAGTA 


TAGCTTTAGT 


GTTAGTTGGT 


360 


GTAGTTGTTG 


TAGCAGTTGT 


GCCTGTCTTT 


GTAGCAATAT 


TCTTCTAAAA 


CGAAAAACCT 


420 


AAGCAAGATA 


ATAGCAATTT 


GAGCCATTGT 


TATTATCGTA 


AAAAAACGTC 


TATACTCCAG 


480 


TTTATAACTG 


GGATATAGAC 


GTTTTTATGT 


ATTTATTACT 


TTTTACTAGG 


AATATAAAAC 


540 


TGTGCATGNC 


GATAATGAAA 


TACGATGTCA 


GACGAATCAA 


AGGGTTTGCC 


AGTCATTGTA 


600 


TAAAAAGTCT 


GGTGGTAACG 


TAAACATGGT 


TCACCTGTAG 


ACAATTGTAG 


TAATGAAGCT 


660 


TCACTTGAAG 


TGAGTTTATC 


TACATTAAAG 


AAAATATCTG 


AAAAACCAAT 


ACGAAGTTTC 


720 


ATGTTTGATT 


CTAAATAATC 


GAAGATAGAG 


CCCTTAGCAA 


TATCATCATT 


TAAATATTTC 


780 


ACGATTTCTT 


TATGATAATA 


AGAATATTCG 


ATACATAAAA 


CATCATCGTC 


CACGAATCTT 


840 


AATCGCTCTA 


AATAGTAGAC 


GGTATCACTC 


GTGCC 






875 



40 

(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 654 base pairs 

45 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:126: 

55 

CAACTAGTGA TTCAGGATAT AGGAATTGTA TTGTCGGAAT TCANACAGGT GGATATGGTC 60 
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5 



TO 



AAGATTTAAA 


AAGATTATAA 


CTTGGCCCAA 


TTATNACGGN 


GATCTTGAAA 


ACGATTANTG 


120 


GATTAGAACG 


AATTCGAATT 


TCTTCAATTG 


AAGCAAGTCA 


ACTTACAGAT 


GAAGTAATTG 


180 


ACGTTTTAGA 


ACGTTCAACA 


AAAGTTGTGC 


GTCATTTGCA 


TATTCCATTA 


CAATCTGGTT 


240 


CAGATACAGT 


ATTAAAACGT 


ATGAGACGTA 


AGTATAC AAT 


GGATAGATTT 


TCAGAACGAT 


300 


TAACAAAATT 


GCATAAAGCT 


TTACCAGACT 


TGGCAGTTAC 


GAGTGATGTA 


ATTGTTGGTT 


360 


TCCCAGGTGA 


AACTGAAGCA 


GAGTTCCAAG 


AAACATATGA 


TTTTATCGTA 


AAGCATAAGT 


420 


TCTCTGAACT 


GCATGTTTTC 


CCTTATTCTC 


C T AGAATTGG 


CACGCCAGCT 


GCAAGAATGG 


480 


ATGACCAAAT 


TGATGAAGAA 


ATTAAAAATG 


AACGCGTGCA 


TAAGGTTAAT 


TACGCTTAAG 


540 


CAATCAACTC 


GGAAAGTTAT 


ATGCGTCTTA 


AATTGGATCA 


AAGATGTGCT 


TGGANGGTNA 


600 


TTCCCTGAGG 


GAACAAGGGT 


GGACACAAGA 


AGGGTACAAT 


TAGGTGGRRD 


CTNS 


654 



{2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1443 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 



35 



40 



45 



50 



55 



CTANCNAANG 


GAANTTCAGC 


ATCCTTAAAA 


ATACCTATTT 


GACTGTAGAA 


ACCTTTTGNT 


60 


GCGTACAATA 


TCTAAACCTT 


GTCGTGCTGC 


TGGAACTGCA 


CCTGAACATT 


CAACAACAAC 


120 


ATCTGCACCG 


TAACCGTCTG 


TAATTC C ATT 


GATATACGTT 


TTTAAGTCTG 


TGTGTTGTAA 


180 


ATTGACTACA 


TAATCCATGT 


GCAATGCTTC 


TGCTTTATCT 


AATCTGACTT 


NGTGGCANTG 


240 


TCCAATCCAG 


TTACCACAAC 


AGGTGCGCCT 


TTACTTTTCA 


ACACTTGTGC 


TACAAGTAAT 


300 


CCGATTGGCC 


CAGGTCCCAT 


TACAACTGCT 


ACATCGCCAG 


AGTTCACTTG 


AATCTTAGAA 


360 


ACGCCATGAT 


GTGCACATGC 


TAATGGTTCT 


TGTCATAGCT 


GCAGACTGAT 


ACGATACTTC 


420 


CGCTTCTGGA 


ATATGATNCA 


AACTTTCTTC 


ACGTGCAATG 


ACATAATTAG 


TAAATGCGCC 


480 


ATCAACTTGT 


GTTCCAATAC 


CTTTTCGATG 


GTTGCATAAA 


TGATAGTTTT 


TTGATTTACA 


540 


GGAATCACAC 


TCATTACANA 


CCATAGAATG 


TAGTTTCAGA 


AGTGACNCGG 


TCACCAACTT 


600 


TAAAATCNTT 


AACGTCTGCT 


CCCAACTTCA 


ACGATNTCAC 


CAGAAAATTC 


ATGACCTAAT 


660 


GTCACTGGAA 


AATTAACTTN 


ATAATGCCCT 


TCATAAGTAT 


GAAGGTCTGT 


GCCACAAATT 


720 


CCTGCATAAT 


GTACTTTAAT 


CTTTAC TTT A 


TCATCTAGCG 


GTGTTGCAAC 


TTCTTTATCA 


780 


AGAAGTTCTA 


AGTTGCCATG 


TCCTTCTCTT 


GTTTTTACTA 


AAGCTTCCAC 


CACAAACACN 


840 


TCGANTTTTT 


ANTTGNAATA 


GACTNNATAG 


NTTNAAGATA 


AGATAGTTAN 


CGATATTNCC 


900 


ACCTTGATCA 


ATACTTGANA 


TTTCAGATGA 


ACCTTTTGNC 


ATTTGTACAT 


TCGTACCTTT 


960 


CGCCATATCT 


GTGAAAATGG 


GTGC T ACGTC 


TGTTGCAATA 


TATAATGAAA 


TTGCAATCAT 


1020 


AATCGTACCC 


ACAATGACAG 


AATGAATAAT 


GTTTCCTCTT 


GCTGCACCAA 


CAATAAACGC 


1080 


GACAACAAAT 


GGTATAGTTG 


CTAAGTCACC 


AAAAGGTAGT 


AC TTGGTTTC 


CTGGTAAAAT 


1140 


AACGGCTAAT 


AAAACAGTGA 


TAGGTACTAA 


AATTAATGCT 


GTCGAAATAA 


CCGCTGGATG 


1200 


ACCTAATGCT 


ACAGCCGCAT 


CCAATCCAAT 


ATAAATTTCA 


CGTTCGCCAA 


AACGTTTATT 


1260 
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TAGCCATGTT CTTGCAGACT CTGAAACTGG CATTAAACCT TCCATTAAGA TTTTTACCAT 1320 

TCTAGGCATT AAGACCATTA CTGCAGCCAT TGACATTCCT AAATTAATGA TGTCTCCAGG 13 80 

TTTGTAACCT GCTAACACAC CAATACCTAA ACC TAAAATT AAGCCGACAA ATATAGACTC 1440 

TCC 1443 



(2) INFORMATION FOR SEQ ID NO: 128: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1103 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 



25 



30 



35 



40 



45 



CTTAGCAACT 


CAAGAAAACG 


TTAATTTATC 


AATTCGCGGA 


ATTCCTAAAG 


AAGTAGTAGC 


60 


ACGTGGTGCA 


CATCCATGTA 


TTATTTCTAT 


GGAGGGTCTT 


GAAAAAGAAG 


GCGACACTTA 


120 


TGTCATTCCT 


CATGTACATG 


AATTATTGAC 


ACCATTAGTT 


TCAGTAGTTG 


CATTACAATT 


180 


GATTTCATAT 


TATGCAGCAT 


TACACAGAGA 


TTTAGATGTG 


GATAAACCAC 


GTAACCTTGC 


240 


TAAATCAGTT 


ACTGTAGAAT 


AATTCACTTT 


TTTAGAATCA 


ATCATGTATT 


AAAATTAAAG 


300 


TATATGGCAC 


CCTTTTAGAT 


TAATCGACTA 


GAAGGGTGCT 


TTTTTAGGTC 


GACTTAGCTT 


360 


TTAC TTCATC 


TTAATTTGGC 


AGAAATGCGT 


NAAAAGGAAG 


TGTTTTATTT 


ATTTAAATAG 


420 


TCTGACAATT 


AAGGGTGTTA 


TGTTAATATG 


ATTTTATGAG 


AAGTATGGAG 


TAGCNATAGA 


480 


GGGGTGACCT 


CGCATGTTAA 


TTCAATTAAG 


ATCCACATTG 


GGCGANTGAA 


GCAAGGAAAA 


540 


ACAATTTTAA 


AAAATATTTC 


TTGCCACATT 


CCTAAAGGTG 


ATAAATGGAT 


ATTATATGGG 


600 


TTGANTGGTG 


CTGGTAAGAC 


AACGCTATTA 


<> 

AATATATTAA 


ATGCATATGA 


GCCTGCGACA 


660 


ACTGGCGGTG 


TTAATCTATT 


TGGTAAAATG 


CCAGGCAAGG 


TAGGGTATTC 


TGCAGAGACT 


720 


GTNCGACAAC 


ATATAGGTTT 


TGTATCTCAT 


AGTTTACTGG 


AAAAGTNTCC 


AAGAGGGTGA 


780 


AAGAGTAATC 


GATGTGGTGA 


TAAGCGGTGC 


TTTTAAATCA 


ACTGGTGTTT 


ATCAAGATAT 


840 


TGATGATGAG 


GTACGTAATG 


AAGCACATCA 


TTTACTTAAA 


TTAGTTGGAA 


TATCTGCTAA 


900 


AGCGCAACAA 


TATATTGGTT 


ATTTATCTAC 


AGGTGAAAAA 


CAACGAGTGA 


TGATTGCACG 


960 


AGCTTTAATG 


GGGC AACCTC 


AGGTTTTTAA 


TTTTAGATGA 


GCCAGCAGCT 


GGTTTAGACT 


1020 


TTATTGCTCG 


TGAGTCACTA 


TTGAATATAC 


TTGACTCATT 


GTCAGATTCA 


TATCCAACGC 


1080 


CTTGCGATGA 


TTATGTGACG 


CAC 








1103 



(2) INFORMATION FOR SEQ ID NO: 129: 

SO 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE ; Genomic DNA 



10 



15 



20 



25 



30 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO:129: 






CTGACATATT 


TAATGTTTAC 


ATCTAGTGCA 


TCAGGTTACG 


CCGTNTGTAT 


AGCGATGTTG 


60 


AGACGTTTAA 


CTGGATTAAC 


TAATATCATT 


GGTAATTTTT 


ANCCAAAANA 


TATTGNTNGG 


120 


GTTTAATTGG 


CCCGAGTACT 


TTTACCANTA 


TCATGGTTAA 


NTAGTATTTT 


GNTGTGACTC 


180 


NAGGTGTACC 


NCCCAAACCG 


TTGCATGCTA 


ATTTAATGAT 


TCGGACTTTA 


AGCGGACATA 


240 


TC C AAC AT AT 


TGCATTTGGA 


CCTATTGCAT 


CACTTGAATC 


AATAAAACAT 


CTTGGTACGA 


300 


ATGG TGGAGG 


ATTTTTAGCA 


GGAAATTCTG 


CAACACCTTT 


TGAAAATCCA 


AATATTTGGA 


360 


GCAATTTTAT 


AGAAATGGGC 


AGTATGATGT 


TACTTCCTAT 


GTCAATGTTG 


TTCTTATTTG 


420 


GTCGCATGTT 


AAGTAGACAT 


GGTAAACGAG 


TACATCGTCA 


TGCGTTGATA 


TTATTTGTCG 


480 


CAATGTTTTT 


CATTTTTATA 


GCAATTCTTA 


CATTAACTAT 


GTGGAGTGAG 


TATCGTGGTA 


540 


ATCCAATACT 


AGCGAATTTA 


GGCATTTATG 


GACCGAATAT 


GGAAGGTAAA 


GAGGTACGGT 


600 


TTGGAGCAGG 


TTTGTCAGCA 


C T ATTTAC AG 


TTATTACGAC 


GGCATTTACA 


ACGGGTTCTG 


660 


TTAATAACAT 


GCATGATAGC 


TTAACGCCTA 


TAGGTGGATT 


AGGACCAATG 


GTATTAATGA 


720 


TGCTAAATGT 


TGTATTTGGT 


GGCGAAGGCG 


TAGGACTCAT 


GAATTTATTG 


ATATTNNTCT 


780 


TACTGACGGT 


GTTTATATGC 


AGTTTGATGG 


TTGGTAAAAC 


ACCAGAATAT 


TTAAATATGC 


840 


CAATTGGCGC 


CCGTGAAATG 


AAATGTATTG 


TCTTAGTCTT 


TCTCATACAC 


CCAATTTTAA 


900 


TTTTAGTATT 


TTCAGCACTT 


GCTTTTATGA 


TTCCTGGAGC 


AAGTGAAAGT 


ATAACGAATC 


960 


CGTCTTTTCA 


TGGTATTTCA 


CAAGTTATGT 


TATGAAATGA 


CATCAGCTGC 


TGGCGAACAA 


1020 


TGGATCAGGG 


TTTGAAGGAC 


TGAAANGAAT 


GAATNCAACA 


TTTCTGGGAA 


ATATCTCTAC 


1080 


AGGAATCATT 


AATGT 










1095 



(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 913 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 





GGTGTTGTTT 


TCCTCCAATT 


TTTGGATCTT 


TTTTGCTCTA 


AAATCTCATC 


TTTTTTAATC 


60 


50 


TAAATAGATC 


AAGAGTGCGC 


TCTAAGTTTT 


CTTCTTTTTG 


AAGAAGTCGG 


GTTTCCTTGT 


120 




CTNTGAAGAT 


CNCAACGTCT 


TTCTCGNAGT 


TCTGCTTCAG 


TNTGTTCTCT 


TNGGAGTTGG 


180 




TNTTCTTCTT 


TTGCNTCAAG 


TAATTTCTCT 


TTTTTGATAN 


TGTCAGCTTC 


TTTATGTGCT 


240 




TGATNAACAA 


TATCNTCGGC 


AGTTTGTCTA 


GCTTGTGATT 


GCTTTTGAAG 


CANCANATTT 


300 


55 


CGGGCNACAA 


CATACCCTCC 


AACAACTCCT 


AGAATGATCC 


CCANCACATG 


AGTAGGAGGC 


360 




TTAATAAATT 


CACACACACA 


CCTCCTTTTC 


TAGGGGTTTG 


CTCCTTTAAT 


ATGAAATCCA 


420 
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ATATGACTTT 


ATGAAATTAA 


AAATAAATCA 


AACCAAATTA 


ATTTGTGCGG 


TTTAATAGGA 


480 




GATAAGTGTC 


AAGTTATCAA 


TTTGCACTTT 


TGCACTATAT 


TGAATGCTAT 


TTCTACTATT 


540 




TTCTAAAGTT 


TTGAAGCTCG 


ATTATAAAAC 


TATTGATAAT 


GTAATAATAA 


ATAAAATAAA 


600 


5 


AAACCTTATA 


GTCTCGATTT 


GTAGTGTATC 


CCATAAAGTT 


AGATATTATT 


CTAGCTTTAT 


660 




GAGATGAATA 


TCACTTTCAA 


TCAATCCTAT 


AAGGATATTT 


GCTAAGTTTA 


ACTATAGATA 


720 




TAAATTTGTG 


TACTATTCTT 


CGTC AAATAA 


TGACTTTGGT 


GCATCTTCTG 


TTTCTTCAAC 


780 




ATCACCATCA 


GATATACCTA 


ATTTTTCTCT 


CAATTTACGA 


TCAATTTCTT 


CTTTAATTTG 


840 


10 


TGGATTTTCT 


TTCAAGTACA 


TTTTAACATT 


TTCCTTACCT 


NGACCCATTC 


GTTCGCCATN 


900 




GC AAGAATAC 


CAG 










913 



(2) INFORMATION FOR SEQ ID NO: 131: 

75 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 0 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 



30 



35 



40 



45 



TGANGTAAAT 


TTGGTTGTGG 


C TC ANTCGC A 


TCTGCGAAAG 


NGNAGTGCCC 


NGAGACCGCC 


60 


TGTAATGTTC 


ATGACGACCC 


TAGAATAATA 


ATAAACTAAA 


AGCGCCGCCT 


NATGAAANTG 


120 


AGCGCCTTGG 


ATGNAATCAC 


TCCAAACGGA 


CACCNTCGAA 


NCCACCTNGA 


GATGTANANG 


180 


AAATACATAG 


TAAACCTACG 


AGTGATC CAN 


CGATATAAGG 


GTTCATGTCT 


GATACAGATG 


240 


TGATTGC TAA 


TGTTGGTAAG 


TAGATAACAA 


TTGCAACACG 


CCCTAAATGG 


TAAACGACAA 


300 


ATAATAATGA 


GCCANTGACA 


CGTATGC TAG 


GGCCAAATCT 


AGCTTCTAAA 


TATTCATATG 


360 


CAGATGTTAC 


CTTTAACTTT 


TTAAAGAAAG 


GGACATAGAA 


ATAAATAAGT 


AATGGAATAA 


420 


TTGCGACGAT 


AGCAATGTTA 


CCAGCGATAT 


ATGACCAATC 


TGTTAAAAAT 


GCTTTCCTCT 


480 


GGTGTAGACA 


TAAATGTAAT 


CGCACTTAAC 


GTAGTAGCAT 


AAATTGAAAA 


GCCAACTACC 


540 


CAAGATGGCA 


AGCGACCACT 


TGCGGTAAAG 


AAACTATTGG 


TACTTTGGCT 


CGCGCGCTTG 


600 


GTAAAATAAA 


CGCCAATGAA 


CAACATAGCT 


AGTAGATAAA 


TGATAACGGC 


AACCCAGTTT 


660 


AGTGTGCCAA 


ATCCAACTTC 


TTTCATGGGC 


AACATCCCCT 


TTACAATGTA 


TTGATTCTTT 


720 


GATGTCTATA 


AATCGTATTT 


TGCAATGAGT 


TGATCTACAT 


GTTCGTCGAT 


GTGCTTCGTT 


780 


AAAAGGTTTG 


AAAGGTCTTT 


TCGGTAATCC 


TGCATCAATG 


TCACGATGAC 


GTAATATTTC 


840 


TTTCAATGTT 


GGATAAATCC 


CCATNGATNA 


CAACTGTTTC 


GATAATGTCG 


TTTGAATCAT 


900 


GTTGTAGTTG 


GTAAGCTTCT 


TGAATTTGAC 


CTTGTCGTGC 


TAAGTCGAAG 


ANTTCTCTAG 


960 


CGCTCGTGCC 












970 



(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 791 base pairs 
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<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEC; ID NO: 132: 



15 



20 



25 



GAAGACATGA 


ACGCATACGA 


AGATAGTTAT 


GAAAACCGAA 


CTAAAAAAAC 


AACTGCTGGC 


60 


AGTAGCGATT 


TAAGTATGTA 


ATTAACGAGG 


GCATTAGCCC 


TCTATTTTTG 


TGTACAAAAT 


120 


AACGATAAAC 


GAGGTATTTA 


ATATGACTGA 


AACAACTTTT 


AATCCAATTA 


CATCATTAAC 


180 


AATTAACAAT 


GAAGAAGTGA 


AAGCAAAAGC 


AACATTTATG 


TTCGATAAAA 


CCGCTAAAAA 


240 


ATTTGCAACT 


GAACAAGAAG 


ATAACAAAGG 


TAGAAAACAA 


ACGATTTCAG 


GATTTACTAA 


300 


TGTTTATNAC 


GCTTTATTAG 


AACGTGACAC 


AGTGGCAATC 


GTAGACTTTT 


GGGAATGTGC 


360 


AACAGCTTAT 


CTAGGTAAAA 


GCGCGCCTAA 


AAGAGAAGAT 


ATTGAAGCGG 


AAATCATGGA 


420 


AATCATCGAA 


AGAGAAAACG 


ACACGTTGAT 


CTTTTACAAG 


GTGCGTTGGA 


CGTAATGAAT 


480 


AATAGCGGTT 


TTTTCAAGCA 


GAAATCACGT 


CTATTCTGGA 


CACAGATGAA 


CCAAGCGCCA 


540 


TCGTTAGCCA 


AAGAAGACGA 


GAAAGAGGGC 


GCGAAAGCTG 


GTATCGAGAT 


GATGAAGAAC 


600 


AACTACAAAG 


AAATCATGAC 


CGTAGCACCT 


TATTAGACTA 


TTCGGAAATA 


AGGCAGATGA 


660 


CAAGTCGTTA 


CATAGGTTAT 


ATGAGTAATG 


ACGAGCTAAT 


GAGCATGCTA 


CCTGCCGAAT 


720 


GGAATGACTG 


GATTATTGGC 


GCTAGACAAG 


CATTGATTGA 


CCAAAGGGAC 


NTCGCTCGTC 


780 


CCGCTCGTGC 


C 










791 



(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 



50 



55 



GGCACGAGCT 


AAAATATATG 


AAGCTAAAGG 


CCGTCCATCT 


GACAATCCGC 


TTATTGTTCA 


60 


TATACACAGT 


AAAGTTCAAT 


TAAAAGATTT 


TACATATACT 


TTGGCACGAG 


CGTAGAAAAG 


120 


TTAATGCAGG 


CATTCTGGCC 


GGGCCCTATT 


TCGTTTATAT 


TGCCGCTAAA 


GCCAGGCTAT 


180 


CTATGTCGAA 


AAGTTTCTGG 


AGGTTTATCA 


TCAGTTGCTG 


TTAGAATGCC 


AAGCCATTCT 


240 


GTAGGTAGAC 


AATTATTACA 


AATCATAAAT 


GAACCTCTAG 


CTGCTCCAAG 


TGCTAATTTA 


300 


AGTGGTAGAC 


CTTCACCAAC 


AACTTTCAAT 


CATGTATATC 


AAGATTTGAA 


TGCCCGTATA 


360 


GATGGTATTG 


TTCAAGCTGA 


ACAAAGTGAA 


GAAGGATTAG 


AAAGTACGGT 


TTTAGATTGC 


420 


ACATCTTTTC 


CTTATAAAAT 


TGCAAGACCT 


GGTTCTATAA 


CAGCAGCAAT 


GATT AC AG AA 


480 


ATACTTCCGA 


ATAGTATCGC 


CCATGCTGAT 


TATAATGATA 


CTGAACAGGC 


AATTGCACCA 


540 
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GGTATGAAGT ATTAAGCATT ACTCACCCAA TACAACACTT ACAATTATTA CAGATATTGA 600 

GAGCAAAATT GGAAATGACG GTAAAGATTG GTCTTCTATA GCTTTTATTG TGCCGAGTAA 660 

TAAGGTGGCG TTTATACCAA GTGAAGCGCA ATTCATTCAA TTATGTCAGG ATGACAATGA 720 

5 TGTTAAACAA GCAAGTCATA ATCTTTATGA TGGTGTTACA TTCACTTGAT GAAAATGAAA 780 

ATATTTCAGC GGCGTATATA TACGGCTTTG AGCTCGTGCC 82 0 

(2) INFORMATION FOR SEQ ID NO: 134: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1107 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 



25 



30 



35 



40 



45 



GTGACTTGAA 


AGATATAGTA 


TATCGGATTT 


GTAATGCGCC 


GTTAATCAAT 


TATGATGCGA 


60 


TTAAGATGAC 


CCGACTTAAA 


ATACNCAGTC 


CGACACAAGA 


TTGC T ATGAG 


TTTTTTGGGT 


120 


GG AG AT AT TG 


AATTATATAA 


TTATCACTAT 


TAATACCTAC 


ATCAAAGGTA 


TGCTATTGAT 


180 


TTAGTTATTA 


AAGNAAATCA 


TAAAACATAT 


AAGAATCAAG 


GAAAAGTAAA 


TACTGATTAT 


240 


TTTTGTTTTG 


GAAAAGATAT 


CATTGCGCCA 


GCAAATGGTA 


CAGTTGAAAA 


AGTAGTGAAT 


300 


GGGGTTCAAG 


ACAACAGTAT 


TGGAAGTACG 


AATGAATCAC 


AGTTTTTAGG 


AAATTATATT 


360 


GTAATTAAGC 


ACGCAGAAAA 


TGAGTATAGC 


TTAATAGCTC 


ATTTACAACA 


ATATTCAATC 


420 


ATTGTGAATG 


AGGGGCAAAA 


TGTTAAATAT 


GGTGACTTCC 


TTGGGAAGGT 


TGGGAATTCT 


480 


GGCAATTCTA 


CTGAACCTCA 


TATACATTTC 


CAAGTAATGA 


ATGATAAGAA 


TATTGAAGCA 


540 


TGTACGTCTT 


TGAAAATTCG 


ATTTTTAAAT 


AATCTAGAAC 


TTATCAAAGG 


GGATGTGGTC 


600 


TGCGGATTAC 


AAGGTGAATG 


ATGGCGATAC 


TTATAAAATC 


TCGACACTAT 


AAAAATGGTA 


660 


TAGTGTCGAG 


ATTTTCTTGC 


TTATTTAGTT 


AATTCAAAGT 


GTACGCCGGA 


TTCATTAGAA 


720 


GTTGATGTAT 


TTGTGTTTGT 


AATAGAGTAT 


CCAGTTATTG 


AAATTTTAGA 


TTCAATATCT 


780 


GAAGCGGTAT 


TTGTAGATTC 


GGGATTATAG 


AAGCTACATT 


CATAAGTGTT 


ATCATCTNTG 


840 


GTNTTCNAAG 


AATTAAACAT 


TCCCTTTGGC 


TTTTAACTTC 


CGACTTTAGT 


GGTGGGNTAA 


900 


TGNCAAAAAG 


TNTGGAGNTG 


CTTATTACTA 


TTAATTAACA 


ACCCAGCCCA 


ATTGGGNCAC 


960 


CCCCCAACGG 


ATTTGGACCA 


ATCCTCAACT 


TACCTGCTTA 


TGAATCCCTG 


GCTCCCAACC 


1020 


CCCGCAAATT 


TGCTAAACCC 


AAAATCC AC T 


TCCAANGTTT 


CCAAACTTAC 


TTTCTTCCAA 


1080 


GNTAATTTCT 


TCAAGGATTT 


TTTTTGG 








1107 



(2) INFORMATION FOR SEQ ID NO: 135: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1135 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 135: 





rU\\3 ill on 1 v_ 


f* A ppf 1 A fPMT 


A A appp A TP* Pi 


PPTAAPAATA 


aa Mprr: a a mm 




60 


10 


L. 1 1 i. Vj/\ 1 v_ 


A fi r P / TV2T"TVP' 
LnA I 1 vjt 1 1A_\_ 






/*^«pii Mpntfnprn a pj 

\_ 1 X X 1 i- V— 1 AU 




120 






1 n 1 uu i X AA 1 




TPTPTA TPiPPi 


PiT AT A PIP Pi AP* 


PV2A APATA^A 


180 




TGTGTTGGTA 


AACCAGTANC 


NAATTCTTTT 


TCTCTGTCGA 


TG AC TGCATC 


TTTAAACGTT 


240 




GACTTCACGA 


ACCCATTTTG 


AAATAACACA 


TCTGACATTT 


GNGACAATAC 


GGATTCTTTA 


300 


15 


TCAGTTGCCG 


ACNAATTGAG 


CATTATATTT 


TCTTTATGCA 


CCTAATTGCT 


TGTCCCATCC 


360 




ATTTTCCCTC 


GCTTCTTTAT 


TTGAATAATT 


TTTTAAAATC 


TCATTTACAT 


CAGAATTTTT 


420 




GCGACTTTGT 


ATGATGCGCT 


TAATTGCGTC 


ATTGTCTTGC 


GCCACATCTC 


TCAATTGTAG 


480 




TAACGCTCTT 


AAGTGTGTCA 


CTTTATCAAC 


AGCAGCAATA 


GGTACAATAA 


TATGGATTGC 


540 


20 


TGTGCCATCT 


GACATGTATA 


TTGGTTCTTG 


TAATATCAAC 


ATACTCATCG 


CTGTTTTATG 


600 




TACATGCTTT 


TCAGAGTATG 


CATGTGGAAT 


AGCAATATTT 


TGCATGATAA 


CCATATACGA 


660 




ATCANCANAT 


ACCNGGGAAT 


TCCATCTCAT 


GAATATATTT 


AATATCAATA 


AAATGATTAG 


720 




CAACTAACAC 


AGCACGNGCT 


TTAGCAATAG 


CTTCATCAAT 


ATTTTCAACA 


GGAGGCNTNC 


780 


25 


TTTNCACGGG 


CCTGGCCGGT 


ATCAAGGCAG 


CTAAATCTAA 


TGTCTTATTT 


TGTGTGACAA 


840 




TCGATCCATT 


AATGGTTGAA 


ATTGAATTAT 


AATTGGCAAT 


AAAATCTTCT 


AAAC CATC AC 


900 




GTAGTTCTGT 


AATGTCATTA 


ACTGTCGTTG 


TGCGTTCAAT 


TAATGCCATT 


AACTTGTTTA 


960 




TTTCC TTATC 


AATGTCAGCC 


GATTCCTTAT 


TAATGTACTT 


CATCACTTCT 


TTACGTAACT 


1020 


30 


TTCGTTGCTC 


ATTTTCAGAT 


AAAGCTACTT 


TTGTGATAAA 


TAATTTTTTA 


TGTGTTAGGA 


1080 




CAAACATTGG 


TGAAAAGACG 


AAGTCATAAT 


CTAATGTGTA 


ATTTTCAAAT 


GTTCT 


1135 



(2) INFORMATION FOR SEQ ID NO: 136: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 973 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:136: 



50 



55 



CGTGCGAGCT 


GCTTTTGGTA 


AAAACGGCGG 


TAATATGGGT 


GTATCTGGAT 


CAGTTGCTTA 


60 


TATGTTTGAT 


CATGTGGCAA 


CATTTGGTAT 


TGAAGGAAAG 


TCTGTTGACG 


AAATACTTGA 


120 


AACATTAATG 


GAACCAAGAT 


GTAAGATGNA 


AATGATGTGA 


TTGACGATAA 


TGGATTGACA 


180 


ATAGTCTATG 


CTTGAACCAA 


GATCCAATTT 


CCAGTCGTTC 


CAAGATGCGC 


TTCGTNCACC 


240 


AGGTTGTTGG 


AAGAATTTAA 


AAGTTGCTTG 


AATTTGAAAT 


TGTTACCC TC 


CAAACAAGAT 


300 


ATTGAACTTT 


CTGAAGCGGA 


CCNAAGTAAC 


ATTTGAAAAA 


TTAATCGATG 


CATTAGAAGA 


360 
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5 



10 



TTTAGAAGAT 


GTACAAAACG 


TATTCCATAA 


J. luun i. 1 X vj 


AAATAATGAA ATCAGCAGAA 


420 


CAATGGATTG 


ATGAATTGCA 


ACTTGAATCA 


CATCCTGAAG 


GTGGTTTCTA 


TAGAGAGACA 


480 


ATTCGAGAAG 


TATTGAAAGA 


TGGACGCAGA 


GCGCCGTTTA 


GNAGTATTTA 


TTTTTTACTT 


540 


ACAGATGACA 


ATATTTCGCA 


TTTTCATCGA 


ATTGATGCTG 


ATGAAGTATG 


GTACTATCAT 


600 


GCTGGTGATT 


CTCTAACAAT 


TCATATGATA 


AATCCGGATG 


GGG AATATAC 


GACTGCAACA 


660 


TTGGGT AC TG 


ATATCCAAAA 


TGGAGATGTA 


TTGCAATATG 


TAGTGCCTAA 


AGGAACAATT 


720 


TTTGCTTCTT 


CAATCGAAAT 


TTCAAATACT 


TATAGTTTAG 


TAGGTTGTAT 


GTGTCAACCG 


780 


GCATNTGAGT 


TTAAGCAGTT 


TGAATTGTTT 


AAGCAATCTG 


AATTAATTAC 


ACAATATCCG 


840 


CATCTTAAAT 


CAGTGATTGA 


NAAATATGCT 


TTAAAATAAA 


AATGATCAAT 


GAAGTGGTTT 


900 


GAAGGTTGTT 


AATAAACCTT 


TGAGTCACTT 


CATTTTTATA 


TGTATTCTTG 


ATTGAATCAG 


960 


AATAGATTTG 


ATG 










973 



15 

(2) INFORMATION FOR SEQ ID NO: 137 : 

<i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1701 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 



35 



40 



45 



SO 



55 



GGCACGAGCT 


GGTTGCATAT 


CTAAACTTTG 


GTAATC TAC A 


CCTTCTTTAG ATTTACCGAA 


60 


TGCGATAGCT 


GGAGACTTAA 


CTGCGGCTAC 


TTTCGCATGT 


GGAATAGCAA 


TACCTCGCCG 


120 


ATACCAGTTG 


TACTTTGTGA 


TTCTCGATTG 


TGAATCGCTT 


CCTTAAATGA 


CGCGACATCA 


180 


CTTAATTTAC 


CTGCCTTGNC 


TAATTGGAAT 


TACCTAACTC 


ATCAATAACA 


CCATTTTTGT 


240 


CATTTGCCAN 


TAAATCCATT 


GCTATCGTAT 


CTTTTGTTAA 


TAACTCTGTT 


ACTCTCATTA 


300 


TTTTCACTCC 


CCATCAAGTA 


CGCTAATCGN 


AACTTGGTGA 


TTCTATTTTT 


TCTTATAGCG 


360 


GTCCCGTGTT 


TGCTAAGTCC 


TCAATCAAAA 


TGCCGGGGCA 


AGTACCGCAA 


TGCGACTGCT 


420 


TGGTTGGGAA 


TGCTTTTTCA 


ATCGTTAAAC 


CTGGAAGCAA 


TTCCAGCCAC 


CATGCCTGCA 


480 


ACTGTACTAT 


CACCAGAGCC 


AACTGTATTA 


ACCACTTTCC 


CTTGTGGATT 


AACTGCTTTA 


540 


ATACTGATTT 


CTTTATCAAT 


ATAAATAGCA 


CCATCACCGC 


CAAGCGAGAC 


AATAACAGAT 


600 


TGCGCACCTT 


TATCAACTAA 


CAAACGACCA 


TATTTAATAA 


CATCTGTGTC 


TGAGTTCACT 


660 


GTTGTATTAA 


ACATCACTTC 


TAATTCATCT 


TTATTAGGTT 


TAATAAATAG 


TGGNTGAAAT 


720 


GGTAAAACGC 


TTTCAAGCCA 


ATTCTTTTTC 


AGCGTCGACC 


GACTAATTTA 


GCACCTGTCT 


780 


GTGCTGTAAT 


TTGTGCAAGT 


TTGCGCATAC 


GCATCGCTTG 


GAATACNACT 


TGGTACACTT 


840 


CCAGCAACAA 


TAACGATATC 


NTCGC TTGTT 


GTATTTTTAA 


TGTGTTGTAA 


CAGTTGTTCA 


900 


AATTGTGTTG 


ACGTTATATG 


AGGACCCGGT 


GCATTGATTT 


CTGTTTCTTG 


TCCTGTTTTT 


960 


AATTTCACAT 


TAATACGTGT 


ATCTTCATCN 


ACNNCAATAA 


AATTCGATTG 


AATTGCNCTG 


1020 


TTANTTAATG 


TATCTATAAT 


GAATTTCCCA 


GGAAATCCAC 


CTGCANATCC 


CAAGGCAGTT 


1080 


GACTCAACAT 


CTAATGTCTT 


TAAGACGGCG 


CGAGACGATT 


AATACCCTTT 


CCCCCCCAGC 


1140 


GAAGTAATAT 


GTTGCTGTTG 


CTCTGTCCAA 


NNCCATCAAG 


GTTNAGAATC 


ATTCGTAAAA 


1200 
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5 



10 



ATGACATAAG 


TCAATTGTAG 


GAG TGNGAGT 


CNCCTGTATA 


AATCATAAAG 


TCCCTCCTAT . 


X260 


AAAG TGAG AC 


TTTTGTTGGT 


ATTCTTTTAN 


CGAGTCTTGA 


GTTAATGCTT 


TTTCAGATGT 


1320 


GATGANTGTC 


GTACTTTCTA 


GCACAGGGAC 


ACGAGCACAA 


TATACTTTAT 


TAAACTTAGA 


1380 


NTGATCCTAT 


AAGNACANAT 


GANTGAGTGG 


CTAATGACAT 


TGCTGTTTGT 


GTAACTAATN 


1440 


CCTCTTGCTC 


ATCGGGAGTA 


GTTAATCCAA 


GTTCAATATC 


TAATCCATTC 


ATCCCGATAA 


1500 


AAGCTTTATC 


GNAACAATAT 


CGTCTTAATA 


TCTCCATAGC 


ACTAGANCCA 


ATCGTAGCNA 


1560 


GTGTATTTTC 


TTTAACTTGA 


CCACCTAGCA 


TAATTGTTTT 


AATAC CTTTT 


GGAAGTAAAG 


1620 


CTTCTACATG 


TGTTAAACCA 


TTGGTTACCA 


CAATGATATC 


TTTCGCTTGA 


ATATATTTAA 


1680 


TTAGCGGCAC 


GAGCTCGTCC 


C 








1701 



(2) INFORMATION FOR SEQ ID NO:138: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 858 base pairs 

(B) TYPE: nucleic acid 

20 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 



30 



35 



40 



AS 



CCCGGTGTAA 


ACCGTATGGA 


TGGATGCTAG 


CACCAANTTA 


ATAATCCTCC 


ACAATANCAT 


60 


TAAGGGTATT 


CATGTAGGTG 


NCACAACATG 


GTAAAGGCTC 


TACCAGTTGC 


TTACCTTAGA 


120 


AC AGC TTTAA 


GTTGNAAATG 


GTTATGAAGT 


AGGTACATTT 


ACGTCGCCGT 


TTATTGAAAC 


180 


ATTTAATGAN 


CGAATTAGTC 


TAAATGGTGT 


GCCAATATCA 


AATGACGCTA 


TTGTAGAATT 


240 


AGTATCACGT 


ATTAAACCAG 


TAAGTGAAAT 


GATGGAACGT 


GAAACAGATT 


TAGGTGTTGC 


300 


AACTGAATTC 


GAAATAATCA 


CAGCGATGAT 


GTTTTTATAT 


TTTGGTGAAA 


TACATCCTGT 


360 


TGATTTTGTC 


ATTGTTGAGG 


CTGGATTGGG 


TATAAAGAAC 


GATTCGACAA 


ATGTCTTTAC 


420 


ACCGGTTTTA 


TCAATCTTAA 


CTAGTATCGG 


TCT AGAC CAT 


ACAGATATTT 


TAGGTGGTAC 


480 


TTATCTAGAT 


ATTGCTAGGG 


ATAAAGGCGC 


GATTATAAAG 


CCTAACGTTC 


CAGTGATATA 


540 


TGCTGTTAAA 


AATGAAGATG 


CATTAAAATA 


TGTTCGCTGA 


ACGCGCAATT 


GAACAACATG 


600 


CAAAGCCAAT 


TGAATTAGAT 


AGAGAAATTG 


TTGTTGTATC 


GC AAAAATGA 


TGAATTTACT 


660 


TACCCGNTAT 


TAAAGATTAT 


G AATNAG AAA 


CAATCAATNT 


NAAGCATTGT 


TAGGGTNGAA 


720 


CATC AAG AAA 


CAANATGCTG 


CATTAGCCAT 


AACAANTCTT 


ATTGGANTCA 


AATTGAACAA 


780 


GGATTAATTG 


AATNCAGATT 


TNCAANAAGA 


TGATAGACGG 


GTTTGATCAA 


GTCCGTGGAC 


840 


NTGNCGTTTT 


GGGCAGGT 










858 



{2) INFORMATION FOR SEQ ID NO: 139: 

50 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 497 base pairs 
{BJ TYPE: nucleic acid 

55 

(C) STRANDEDNESS: single 
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ID) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 



10 



15 



GAATTCNAAG 


TCACTTTCTT 


TTTTAAGCTT 


TAAATTTCTC 


CCCATTTTTT 


TAGCCCCCTA 


60 


TAAGGATTGA 


ATATCAATGC 


CTTCTNTCAT 


TAAAATTTCT 


CTAATTTGCG 


AAACAAATAA 


120 


TAATGCATGT 


TCTCCATCAC 


C ATGC AC AC A 


AATTGTATCT 


GCTTGTAACG 


TTACTTCCTT 


180 


ATTGTTTTGT 


GAAATAACTT 


TATTTTCCNN 


CACCATCTTT 


AAAACCTGCT 


TAAGTGC TTC 


240 


GTCAGTATCA 


GTAATCACAG 


CATCACTTTC 


TTTTCTGATT 


GATAAAATCA 


TATATGTTCC 


300 


TATAAACACT 


AAAAATCCTA 


TAACTAGGTA 


ATAATATTAA 


ATTCAAGGAT 


CGANCCTCCG 


360 


CTAAGCGACA 


ACAACAATGG 


TGACAATAAC 


AACAACAACT 


GAGAATTNAA 


AT AC CTAATT 


420 


CAAAAAGGGG 


TNATNGGNCT 


TACAATGAAT 


GTGGCCNAAA ATTGGGGAGG 


NTCCAAGGNG 


480 


GGNTCAAATT 


RRDCTNS 










497 



(2) INFORMATION FOR SEQ ID NO: 140: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 969 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 







TTGGAAAATG 


TAATTGAATG 


AAACGGGTAA 


AACCTTCCCC 


AAAGC AC C AT 


60 




AATTGTTGCA 


AGTAGGTTGC 


CACCCCAATC 


CTGGTTTACC 


CTGATTTGAA 


AGGCTTTGGG 


120 


40 


CAAGCCGAAA 


TATAGATATC 


TCTAACCGAC 


GAGATTTATA 


ATTTGATGAT 


TTATCAATTA 


180 




GGTGCATTAC 


AAGGGTTTTG 


TCGCATTCAT 


CAACTTAAAA 


TTAATCATGT 


TAAACCCGCA 


240 




TGGTGCATTG 


TATCAGATGG 


GTGCAAAAGA 


CAGAGAAATA 


GCAAACCCCC 


GTNATANCAC 


300 




NACCTGTTNA 


TGNCNTTGAT 


CCATCACTAG 


TGTTAGTAGG 


ATTAGCAATC 


ATATC TAATT 


360 


45 


TCAGAAGCAA 


AGAATGTCGG 


ATATAATCCA 


GCTTCTGAAG 


TGTTTGCTGA 


TAGACGATAC 


420 




GAAGATGATG 


GGCAGCTCGT 


TAGTAGAAAA 


GAAAGTGATG 


CTGTGATTAC 


TGATACTGAC 


480 




GAAGCACTTA 


AGCAGGTTTT 


AAAGATGGTG 


NAGGAAAATA 


AAGTTATTTC 


AAAAAACAAT 


540 




AAGGAAGTAA 


CGTTACAAGC 


AGATACAATT 


TGTGTGCATG 


GTGATGGAGA 


ACATGCATTA 


600 


50 


TTATTTGTTT 


CGCAAATTAG 


AGAAATTTTA 


ATGAAAGAAG 


GCATTGATAT 


TCAATCCTTA 


660 




TAGGGGGCTA 


AAAAAATGGG 


GAGAAATTTA 


AAGCTTAAAA 


AAGAAAGTGA 


CTTTGAATTC 


720 




ACAAAAAATC 


ATAAAAGGTT 


ATT ATT AGG T 


TCTGTATTTC 


ACGATGGCAA 


CTTCTGCAAT 


780 




TGGCCCAGCA 


TTTTTAACGC 


AAACAGNAGT 


ATCAACATCA 


CAATCGTTTG 


NAAGTCNCGN 


840 


55 


ATCTGCCATA 


TNAC TGTCTA 


TCATCATTGA 


CATTGGTGCA 


CAAATTAATA 


TATGGCGCAT 


900 




ATNAGTTGTA 


ACTGGTTTAA 


GAGGTCAAGA 


AATATCAAAT 


AAAGTTGTTC 


CTGGGCTTGG 


960 
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TCTCGTGCC 969 
(2) INFORMATION FOR SEQ ID NO: 141: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 632 base pairs 

(B) TYPE: nucleic acid 

10 <C) STRANDEDNESS: single 

(D> topology: linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 





GGNACGAGCG 


GCACGAGCGC 


ACTTATTTTA 


TCAGAACATT 


CTAAAAAGAA 


TATATTCAAT 


60 


20 


ATAAAAAAGG 


CTGTGTAGGG 


GTGTCTTATG 


AATAATAGAA 


ATGTGTATGA 


TATCGAAGTA 


120 




AGTGATTATA 


AAGGCTTAAC 


TTATAAATTA 


GAAGCATTTA 


GAGGTAAAGT 


GATTTTAGTT 


180 




GTTAATACTG 


CAACAGAATG 


TATATATAGC 


GAACAATTGA 


AAAAACTAGA 


GACTTTGTNT 


240 




CAAAAATATA 


AGGATCGTGG 


GTTTGTAGTG 


TTGAGTTCCC 


CGAATAATAA 


TTGTGACAAT 


300 


25 


CGACAACCAG 


GNTCTAATGA 


AGAAATCTTG 


AAAATTAGNC 


GGNGAGAAAT 


TTGGGTGTTA 


360 




CATTTCCAGT 


GCTAGCTTAA 


AATATCTTGT 


GAACGGGAAC 


GAATGAACAT 


C CCCGNT ATT 


420 




TACGCATTTN 


ANAGGAATGA 


ACAAACCAGG 


AATCTTNGGG 


CCCCCAATAA 


AATGGAATTT 


480 


30 


CACAAAAATT 


TATAATCGAT 


CGACAAGGCA 


ATGTTGTAAA 


TCGATTCTTG 


CCAATGGAAG 


540 


ATCCAATGGA 


TATATCGACA 


AATATAGAAA 


TATTATTGGA 


AGAATCTTCA 


ATCTTAAATT 


600 




TAANATTGAG 


CGCTTAGTNT 


GCAAATACAC 


AA 






632 



(2) INFORMATION FOR SEQ ID NO: 142: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 662 base pairs 

(B) TYPE: nucleic acid 

40 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 



so 


CCCCACNAAG 


TACNCGAAAC 


GCAAACAAAC 


ATCTTAAAAG 


GAGGAACGAA 


CAATGCAAGC 


60 


ATTACAAACA 


TTTAATTGGA 


AAGAGCTACC 


AGTAAGAACA 


GTGGAAATTG 


AAAACGAACC 


120 




TTATTTTGTA 


GGAAAAGATA 


TTGCTGAAAT 


TTTAGGATAT 


GCAAGGGCAG 


ACAATGCCAT 


180 




CAGAAATCAT 


GTTGATAGCG 


AGGACAAGCT 


GACGCACCAA 


TTTAGCGACA 


GCAGGTCAAA 


240 


55 


ACAGAAATGT 


AACGATCAAT 


CAACGAATCA 


GGATTATACA 


GTTTAATCTT 


TTCTAGCAAA 


300 




TTAGAAAATG 


CGAAGCGGTT 


CAAACGTTGG 


GTAACTTCGG 


AAGTTTTACC 


AACATTAAGA 


360 
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AAAACTGGTG CTTACCAAGT ACCTAGCGAC CCAATGCAAG CATTGAGATT AATGTTTGAA 42 0 

GCTACAGAAG AAACAAAACA AGAAATTAAA AACGTAAAAG ATGATGTTGA TNGATTTGAA 480 

AGAAAATCAA AAACTGGATG CGGGAGACTA CAATTTCTTA ACTAGAACTA TCAATCAAAG 54 0 

AGTAGCTCAC ATNCAAAGAC TACATGCGAT AACAAATCAA AAACAACGTA GCGAATTATT 600 

CNGGGATATT AATTCAGAAG TGAAAAAGAT GACTGGTGCA AGTTCAAGAA CGAATGTTAG 660 

AC 6 62 

(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 960 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 



30 



35 



40 



TAATTCGCAA 


TAGGAGTGAT 


GAATATCATA 


AATTTTACCC 


TCCAAATGAA 


GCTAATGAAG 


60 


TCCTGGACCC 


GAGTAAGACG 


CATGTAGCCA 


AGCTAAAATA 


ATCCACTCTA 


CCTTATCTTT 


120 


AGTTAATAAT 


GTTACTAAAT 


GTTGTTCATA 


CGCTGCTTTT 


GAATCAAATT 


GTTTTGGTTC 


180 


ATTAATATAA 


ACAGGAATAT 


CGTGCTTGTT 


TGCTCTATCT 


ATACAAAACG 


CATTTTGATG 


240 


ATCCGTATAT 


AGCNCCGTAA 


CTTCAATATT 


TTCAAGTTTT 


CCTGATTCAA 


CATGCTCAAC 


300 


TATATTTTCA 


AAGTTACTTC 


CTGAACCTGA 


TGCAAAAATC 


GCAATTTTAA 


CCATTGTTAT 


360 


ACCCCCAACA 


ATTCAATTGC 


AGTTGACTCA 


TTTTTCACAA 


TATGACCAAT 


TTGATAAGCT 


420 


TCCACATTTT 


GTTCTGCTAA 


AATC TTC AAA 


GCGCGTCGAT 


GCATCTTTTT 


CATCAACGAT 


480 


AACCGTATAG 


CCAATACCCA 


TGTTAAAAAT 


GTTATACATT 


TCATTTGTGT 


CTATATTGCC 


540 


TTGTTGTTGT 


AACCAATCAA 


ATATTTTTGG 


CGTTGGAAAT 


GATGTAGTAT 


CAATTCTAGC 


600 


AGCATATCCG 


GCTGGCAATG 


CACGTGGAAT 


ATTTTCATAA 


AAACCTCCAC 


CAGTAATATG 


660 


ATTCATTGCC 


TTAATAGAAA 


CTTCTTTTTT 


TAAAGCAAGT 


ACAGGTNTGA 


CATATAATTT 


720 


AGTTGGCTCT 


AAAAAGACAT 


CTATAAATGG 


ACGATTATCG 


NAGGGTGATG 


CCAAATCAAT 


780 


GNCTGATTCA 


NTAATTAATN 


TGCGCACTAA 


ACTGTNTC C A 


TTNGANTGAA 


TGNCACTTGG 


840 


ACGCAAGTCC 


TATAACAACT 


TGGCCCTCTT 


NCAATTCTTG 


AACCATCTTA 


CAATAGNCAA 


900 


CCTTTTTCAA 


CTGCTCCAAC 


AGCAAATCCG 


GCTACATCAT 


ATTCACCTTC 


GTGATACATT 


960 



(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1013 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



125 
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(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

5 



10 



15 



20 



25 



TAATNAATTT 


GGCGANTCAC 


TTGTCGCTAA 


GTGGCTCCAC 


CTTGTCATCT 


ATTATTGCAT 


60 


CACTCATGGG 


NCGNACATCA 


TAGTGTTGAT 


TTGATTCAGC 


CATATCNACG 


TTTTGATTTT 


120 


TCTAATAGAA 


GATCAGCAAC 


AACATCAACA 


TTTGAATGAT 


TCATATATGA 


TGCAGGTACG 


180 


TCTTTTAATG 


TTTTAATGTT 


ATCAATATAA 


AGATTGATGT 


AGTGTTGCGG 


GATATTGTAG 


240 


TGATGTTCAA 


GTAACATATC 


AGTAACAAGT 


TGATTAAAGA 


CACTTTCATC 


TAATTCACCA 


300 


CGTGCCACAG 


CGCTTTCTAT 


TAATGCTTTA 


TTTGGGAAAA 


TAGGCACGAG 


CGAACGTCAC 


360 


GTAACCATTT 


NGCGACATCT 


TCAAACGTAT 


CCGCTTCTAA 


TCCTTCCCAG 


GGGTTACGTG 


420 


CTGCAAAAAT 


CGAAATCGGT 


GATAATGGTG 


TAATAACACG 


TTTCGCATTT 


TCAATGACTG 


480 


AATTGATATT 


TAAC TGTGTT 


GTCATACCTT 


TCACCTCCTA 


TAAATACTTC 


TTCAAATAAT 


540 


TCGGATGACT 


TTCTATCGCT 


TTCGAGCGTG 


C TTC ACC TAG 


ATT AAC TAAC 


CACACGTACA 


600 


ATACCGCAAA 


ACCCTTAGAG 


TATCGATGAC 


GCGCCACCCA 


AATACTTAAT 


AAACTGCCAA 


660 


AGATTAAAAT 


GACAACACTA 


ATGATGACAC 


TCACTGTAGG 


CGGTGTTGTC 


GCATGTGTTG 


720 


TTATATTTTG 


GTAATACATC 


GTAAAAATAA 


TTGTGTGTGA 


TGACGTAGAT 


AAATGTCACA 


780 


ATTGCAATCA 


AAATCATGCC 


AACCAGACGT 


GCCATGCGCC 


CTTTACTAAA 


GGCTACCATC 


840 


TGATTCCAAG 


ATACAAGTTA 


ATGACCATGC 


TAGAATGAGT 


GCACTTAACA 


CTTCATATGC 


900 


ACTTCTGTCA 


CTACCATCCA 


AATAGAATGC 


ACGATAATAG 


CTAATACACG 


TCCATGACAT 


960 


CCAGCATAAG 


CTCTTACAAT 


GCTGTTTTGA 


ATATGATCGC 


TCCCAARRDC 


TNS 


1013 



(2) INFORMATION FOR SEQ ID NO: 145: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 



45 



50 



55 



ATCCACCCAT 


NACGNTACTC 


ATANCATTGT 


CAACAGTAAG 


TAATGTCCAT 


ATGCATATCN 


60 


ATNCCGCTCG 


GCANTATANC 


ACCCAACCGG 


ATTTCATAAG 


TGGGCGGATA 


AAGTGATNAG 


120 


AGTGATCTCA 


ACACTATCGA 


NTAGNTACCA 


GCGTTCTTTA 


TCGGTNTTAT 


TTTATNATTT 


180 


ATTGTCACAN 


ATTTANTGAA 


TATAGATAGC 


GTT AT AC TAA 


GTCAGNTTAT 


ATT AC CTGTA 


240 


ATCACGCTAT 


CTTTAGGTAT 


GTGTGCATAC 


ATCATTCGTT 


TAGTGCGTTC 


TAATTTATTG 


300 


ATGTTATTGC 


AAAGTAATAT 


CGTACAANCA 


GCAAGATTAC 


GCGGTATGAA 


TGANCGTTAT 


360 


ATTTTAATTC 


ATGATTT AC T 


AAAAC C AAC A 


ATTTTGCCGA 


TTATCCCATT 


ACTAGGGATT 


420 


TCACTTGGCA 


GTCTAATAGG 


TGGTACTGTA 


GTGATTGAAA 


ATTTATTTGA 


TAATACCTGG 


480 


TATTGGTTAT 


CTATTAATGG 


GATAGTATTA 


AATC TCGAGA 


TTATCCTGTT 


ATTCANGGAT 


540 


GCGTGTTATT 


TAT TGGCTTC 


TTCGTTGTTA 


TTATCAATAC 


GATTGC TG AT 


TTATTAACGT 


600 
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TATTACTTGA TCCGAAGCAG CGTTTACAAT TAGGAAATCC CACAAAACAC AACCAATACA 660 

CCATTGATAT CAGAAAGTAG TGNCCGTCAT GCATAAAATA TTTTCANAGA ATAACCTGAT 720 

ATTTTTTGTA TTCGTTGCAT TTATTTTTGT GGTAATTGTA CTGCAATTCT TTGTCAGTAG 7 80 

TGAAAATGCA ACCACAGTCA ATTTATCACA AACTTTTGAA CCGATTAGTT GGTTGCATTT 840 

ATTAGGAACT GATGATTATG GGAGAGATTT ATTTACCCGA ATTATTATCG GTGCACGTTC 900 

AACATTGTTT GTTACTGTTT TAACATTAAT AGCTATCGTT GTCATAGGTG TTACACTAGG 960 

TCTATTTGCC GGATATAAAA AAGGG TGGAT TGAACGATTA GTGTTAAGGT TTATTGATGT 1020 

TGGTCTAAGT AG 1032 



(2) INFORMATION FOR SEQ ID NO: 146: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 



30 



35 



40 



GGCACGAGAC 


TTTGGTGGTA 


GTTTTGAAAG 


ATTACAAGCA 


TTGACAACAA 


AAACAACATT 


60 


ACCCGTATTA 


TCCAAAGACT 


TTATAATAAG 


ACCCCCCCCA 


ANTTGATGTT 


CCTAAACCNA 


120 


GCTGGTGCAT 


CTATGATTTT 


ATTGATCGTT 


AACATCTTAT 


CGGATANACA 


ATTGAAAGAT 


180 


TNAATATACC 


CTACGCAATA 


TCCCAAAATC 


NAGAAGTGTT 


AATTGAAGTA 


CATGATCCCC 


240 


ATGAATTAGA 


ACGTGCCNAT 


ANGGTTAATG 


CTAAATTGAT 


TGGTGTAAAT 


AACAGGGACT 


300 


TAAAACGATT 


TGTCACAAAT 


GTGGAACATN 


CAAATACTAT 


TTTAGAAAAT 


AAAAAACCAA 


360 


ATCATCATTA 


TATTTCTGAA 


AGTGGTATTC 


ACGATGCATC 


TGATGTAAGA 


AAAATCTTGC 


420 


ATAGTGGTAT 


CGATGGCTTA 


CTAATAGGTG 


AGGCGCTTAT 


GCGTTGTGAC 


AATCTATCTG 


480 


AATTTTTACG 


ACAACTGAAA 


ATNCNAAAGG 


TGAAGTCATG 


ATGAAATTGA 


AATTTTGTGG 


540 


CTTTACATCA 


ATAAAGGATG 


TTACAGCGGC 


CAGTCAATTA 


CCTATTGATT 


CGATAGGTTT 


600 


CATCCATTAT 


GAAAAAAGTA 


AAAGGCATCA 


AACAATACCC 


AAATAA 




646 



(2) INFORMATION FOR SEQ ID NO: 147: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 565 base pairs 

(B) TYPE: nucleic acid 
so (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 147: 
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GGCACGAGCG 


TC AGTC C AAA 


TCACGCCTTG 


TGGTTTCTCT 


GGAATTGTCA 


TCTATTACTC 


60 


5 


ACCCCCAAAT 


ACATCATTAA 


TGTTAATATT 


GTGAATTGCT 


TCAATTGGAT 


TTATTGTTTC 


120 




ATCTACAGNN 


CGATATCGCT 


TACTATCAAT 


CATGNCATCT 


ACATGACATA 


C CGATTG AT A 


180 




ACTACAAAAA 


GCACATGGCA 


ATTTGTGTGT 


GTNCTTTAAT 


GGTGCAAC TN 


CAGTATGTCC 


Z 40 




ATCCATAATA 


TNTGAAGCTG 


TNTCTATAAA 


ATTCTCTNTG 


TTATGCTGAA 


TGAATTTATA 


3 00 


10 


AATTGNTGGT 


TCATCTGACA 


CTTGGCTGGC 


TCGTTTACTC 


AAAGAGACAT 


C TTTT ATTC A 


3 60 


ANCCAACTGG 


GTACAAATAT 


CTGAAGGTGA 


ANTTAGGGTT 


CTTNAACGAA 


TTTC CAATGC 


420 




ATCAAATAGC 


AGGGGGGTCT 


TGCATTCAAC 


TAAACCCTCA 


GGTTAANCTT 


NNNAATTAAT 


480 




CTTGNTCTTG 


GNTNTCTTTC 


ANCAATATCA 


AGACCATGAT 


TTAATTTACT 


CTTGGGCCAA 


540 


15 


GTCATGGAGT 


TTTATATCCA 


CCGGG 








565 




(2) INFORMATION FOR SEQ 


ID NO: 148: 








20 


(i) SEQUENCE CHARACTERISTICS: 










(A) 


LENGTH: 919 base pairs 










(B) 


TYPE: nucleic acid 












(CJ 


STRANDEDNESS : single 










25 


(D) 


TOPOLOGY : 


linear 












(ii) 


MOLECULE TYPE: Genomic DNA 








30 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 148: 








CTTGAAGTAG 


TAGGGTTACA 


AGGCTCAACT 


TACCTTTTAA 


AAGGACCAAA 




60 




GTAAAGTTAA 


ACCAATCAGA 


AATGAACGAT 


GATGATGAAT 


TAGAAGTAGG 


1 \ X A X 


120 


35 


AGTTTCTTCA 


TTTATCCAAA 


CCGTTCAGGT 


GAATTATTTG 


GAACTCAAAA 


X A X V_ 1 UA J. 


180 




ATTACGAAAG 


ATAAATATGA 


TTTTGGTAAA 


GTACTTAAAA 


CGGATCGCGA 




240 




ATAGATGTTG 


GGTTTACCCC 


GNGAAGTGTT 


AGTACCATGG 


GAAGATTTAC 




300 




ATCACTATGG 


CCACAACCTG 


GTGGATCATT 


TGNTAGTTAC 


ATTACGAATT 


GACCGTGAGA 




40 


ATCATATGTA 


TGGACGTTTA 


GCGAGTGAAT 


CTGTTGTAGA 


AAATATGTTT 


ACACCTGTAC 


420 




ACGATGATAA 


TTTAAAAAAC 


GAAGTCATTG 


AAGCCAAACC 


TTGGCNCGAG 


CGTATTACGA 


480 




ATTGGTAGCT 


TCTTATTAAG 


CGAATCAGGT 


TACAAAATTT 


TCGTACATGA 


ATCAGAACGT 


540 




AAAGCTGAAC 


CAAGATTAGG 


TGAATCTGTT 


CAAGTTAGAA 


TTATC GGGCA 


TAATGATAAA 


600 


45 


GGTGAGTTAA 


ATGGTTCATT 


TTTACCACTT 


GCACATGAAC 


GNTTTAGACG 


ATGACGGCCA 


660 




AGTCATCTGT 


GATTTACTAG 


GTGAATATGA 


TGGGGAATTA 


CCATTCTGGG 


ACAAAATCAA 


720 




GCCCTGAAGC 


GATTAAAGAA 


GTATTCAAAT 


ATGAGTAAAG 


GTTCATTCAA 


AACGTGCAAA 


780 




TCGGNCACTT 


ATATTAAACA 


GAAGGATTAT 


TAATATTAGG 


AAACAGGGTA 


AAAATCACTT 


840 


50 


TTAACTTAAA 


AAAGGGTTGG 


CGGGTCGNNT 


TTGGNCTCAA 


AAAGAATTAA 


TCATTNTNAC 


900 




AACGNNNTCG 


GNGGATGCG 










919 



(2) INFORMATION FOR SEQ ID NO: 149: 

55 

(i) SEQUENCE CHARACTERISTICS : 
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IA) LENGTH: 955 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



75 



20 



25 



30 



(xi ) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


149: 






CTGCATCTGG 


TGAAGGAAAT 


CAANTANNTN 


TTATNNCTGA 


AGTGATTGCA 


TGGTGCAACN 


60 


TCAAATATCA 


CATATGGCTC 


AGTGGANTAT 


ATGGATAAAG 


GCTTAACAGG 


TCATATCATG 


120 


CGGCGTGGTA 


TTACTGAAGC 


GGATGCCTCA 


ATTAATTGGG 


CACTAGGTTT 


AATGAATGAG 


180 


GGTAGC C AAA 


TTATTGATAA 


TACAACAAAT 


TTATGTGGTG 


ATCGCACAAC 


AAGNNCACTT 


240 


AAATCAGGAG 


GTGNAGGTAC 


AGGAGAACAA 


AAAATTAATC 


TAACATCTAA 


ANTCGCACAA 


300 


ATATGGTAAA 


GAAACAGATG 


GTTATNTCCT 


TAAACATGGT 


GTTATGAAAG 


AACATGCATC 


360 


ATCTTGTATT 


TTAATGGGTA 


TCCGGC TACA 


TTANGCATGG 


GGGGAACTAA 


ATCAAGTGCT 


420 


AATCAGGAAT 


CACGTGTTAT 


TNATGTTATC 


TTGAACATGC 


TCGGGGGNGA 


CGCGAATCCT 


480 


ATTTTATTTA 


ATTGANGAAG 


ATGATGTACA 


AGC TGGTC AT 


GCTGCATCAG 


TAGGCCGTGT 


540 


TGGATCCCAG 


ATCAACTTTT 


ACTATTTAAT 


GAGTCGTGGT 


ATTTCTCAAA 


GAGAAGCGGA 


600 


ACGTCTTGTT 


ATACAGGGTT 


TCTTAGATCC 


AGTAGTACGT 


GAATTACCTA 


TCGAAGACGT 


660 


TAAACGTCAA 


TTGAGAGAAG 


TAATTGAACG 


CAAAGTTTCT 


AAATAATATT 


TTGAAAATAA 


720 


AAGTTTGTAA 


TAGATATAGA 


CTGTCGATAT 


TGGTATAAGA 


CTAATACAAC 


GTCAGTATTT 


780 


AAATGATTAG 


GATTTTTATT 


TAAGAAAGGT 


CGTGAATGAA 


GTGGCCGAAC 


ACTCATTTGA 


840 


CGTTNATGAA 


GTAATCAAGG 


ATTTTCCGAT 


ATTAGATCAA 


AAAGTCCATG 


GCAAACGTTT 


900 


AGCATATCTT 


GATTCAACAG 


CGACAAGTCC 


AACGCCTATG 


CCAGTGTTAA 


ATGTT 


955 



<2) INFORMATION FOR SEQ ID NO: 150: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 base pairs 

(B) TYPE: nucleic acid 

40 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 



SO 


NAAATTTC CT 


ATACGTTATA 


CTTTAATTGT 


TAATAAGCCA 


CATAAATAAG 


AGGGGGAAAT 


60 




GCTGTGTACA 


AGCAAGGTGA 


ACCAAATTTA 


TGGACTGGAA 


GGTTAGATAG 


TGAAACAGAC 


120 




CCGAAAAAAT 


TTAGACATTT 


TCAAACAGTA 


ACATTTGAAG 


ATTTGTC TAA 


GCTGGAAAAG 


180 




AGTAGTATGC 


CATCAGGGGT 


CGGTATATTA 


GGCTATGCTG 


TTGGACAAAG 


GTGTTGCTTT 


240 


55 


ANACAAGGGG 


CGCATTGGTG 


CAAAAGAAGG 


ACCAGATGCG 


ATTAAACAAG 


CATTTGCAGG 


300 




TTTGGCGGAT 


TTGGATCACT 


GTGAAACTNT 


AG TCGATTAC 


GGAAATGTTT 


NTCATGATCA 


360 



129 



EP 0 841 394 A2 



TGAGGAATXA NCTNGATACN CAACCANGAA TTTGGTACTG TNTTGGCAAA TTGAGCTCTN . 420 
TCTTTAGAGC TTGGNNCATT GATAGGTTCT CTC GCAGGTG GT 4 62 

5 

(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 
1Q (A) LENGTH: 752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

75 

<ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 



20 





GGCACGAGCG 


TAAGATTAAC 


CATTGACCAT 


AATATATATT 


GTGTTTTTCC 


AAAATCGGCT 


60 




CTGCTAATTT 


TAAATAGGGG 


CGATATATTG 


TTATAAAACT 


ATTGAAAAAT 


TCTTGTGATA 


120 




GCATAGTGAC 


ATCTCCTAAG 


ACAAAATAGT 


TAGCTTAGCT 


AACCTTTTTA 


CAACAATAGT 


180 


25 


AATTATAAAA 


CGGGAGCAAT 


TAGAAATCAA 


TATATAATTA 


TTAAGAGCAA 


AAATAATTAT 


240 




AC TTTGTT AA 


AATAAGCGTA 


ATTACATGTA 


AATAGGGGGA 


TACTAATGAT 


ATTGAAATGT 


300 




GCATCACATC 


ACTCATTATA 


TAGATCAGTT 


AGATCGGTTT 


AGTTCTCCAG 


GAGATGTTAT 


360 




AAAATNACAT 


TCAGGTGGGT 


ATCATCATAA 


ATATGGAACA 


TTCAATAAAT 


TAGGTTATAT 


420 


30 


CAATGAAAAT 


TATATTGAGC 


TACTGGATGT 


AGAAAATAAT 


GAAAAGTTGA 


AAAAGATGGC 


480 




AAANACGATA 


GAAGGTGGAG 


TCGC TTGCGC 


TACTCAAATT 


GCACAAGAGA 


AGTATGAGCA 


540 




AGGCTTTAAA 


AATATGTGTG 


TGCGNACAAA 


TGATATAGAG 


GCAGTTAAAA 


ATAATCTACA 


600 




ANGTGAGCAG 


GTTGANGTAG 


TAGCGCCGAC 


TCAAATGGAA 


AGAGATACAC 


ATAAAGATGG 


660 


35 


TAAGGTAAAG 


TGGCAATTGC 


TTATATTATG 


AATCAGGATG 


ATGATGAAAT 


TACGCACCAT 


720 




TTTTTATTCA 


ATGGGAGAAA 


GTGCCTCCAT 


GC 






752. 



(2) INFORMATION FOR SEQ ID NO: 152: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : single 

{D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

CAAATGTTTA TCATGATATG ATGAATATAA TAATCGGGTA TATAACTGTA TGATTAATTA 60 
55 CACAATAAAG AGAGGGAATA TAATATGNGN NAAGTGTCAA TTAAAGATGT TGCTAGAGAA 120 

GCTGGTGTAT CAGTTNCANC TGTGTCACAT ATTTTAAATC ATAATGATAG TCGTTTTTCC 180 
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5 



10 



GC ANCACCGA 


TAAAAAACGT 


ACATGCTGTT 


CCAGAACGTT 


TAGGCTATGC 


CCCTAATAAA 


240 


CATGCAAAAC 


AAGC TCGTGC 


GCGGCAGTAA 


AATTCAAACT 


ATTGGCGTCA 


TTTT AC CT AG 


300 


CTTAACAAAT 


CCGTTTTTCT 


CAGCACTGAT 


GCAAAGTATT 


CATGACCATA 


AACCATCTGA 


360 


TGTTGATTTA 


TGCTTTTTAA 


CATCTACAGC 


AACTGATNTG 


TATGACAATA 


TTAAACATTT 


420 


AATTGATCGA 


GGTATTGACG 


GATTAATTAT 


CGCACAATAC 


ATATCATCCC 


CGGACGCCCT 


480 


AAATAACTAT 


CTAAAGAAAC 


ATCATGTACC 


TTATGTCGTA 


CTGGATCAAA 


AATGACCATC 


540 


AAAGGCTATA 


CAGATTGTGN 


TCCGGACAAA 


ATTGAATATC 


AAGGGTGGGA 


CAACTTTGGC 


600 


AGCACAACAA 


TTTAATAGAA 


CTCGGGTCAC 


AAACCATATT 


GATAATTGGT 


TGCAACCATA 


660 


TTGACAATGA 


TGGNGAATAT 


TGTCGACTCC 


TGTCGCTGGA 


TTTGTCGATA 


CTTTGCGCGC 


720 


GAATCAATTG 


CCAGAACCAC 


AAATCGTCCC 


TACTGAATTA 


TCTAAACGCC 


GTGGCTAACC 


780 


ATGTTGAATG 


A 










791 



(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1314 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 





TTTGTCTTTG 


GAATGGGCTC 




ACCTAATAAG 


AAATGATAGG 


GCATTTGAGA 


60 




TTGGAAGGNC 


ATTTTGGCTT 


TGTGCAAATA 


ATACAATAAG 


CTAAATGTCT 


NTTTTTGTTT 


120 


35 


TTGTGAAAAT 


ATGATGGATG 


GCTTGTGTGG 


GCAAGTTTGC 


TAATTTAATA 


AGATATGCAT 


180 


TTTTCAATTT 


AGGAGTTGGC 


CATGCATCTA 


CACTTTATAA 


TGGTGAGAGC 


GTGGTGAGGT 


240 




ATTGTTAATC 


ACGCAATTGT 


AGCGAGGAGT 


TATTGCTACA 


TATGTCGTTA 


TGGCCTATTG 


300 




ATTTTCTAAA 


ATAGCTGTAT 


CAGATCATGT 


GACNAAATAA 


AAATAATTTG 


TTGAAAGCCT 


360 


40 


TTACATAACT 


TGTCTAGACA 


AGTTATACTC 


GTTTTAAGAC 


ATTAAGGGAG 


TGAAATATAT 


420 




GGCTGTAAAA 


AGAGAAGATG 


TAAAAGCCAT 


CGTAAGNCGC 


TATTGGGGGA 


NAAGAAAATC 


480 




NTGAAGCTGC 


ANCGCATTGT 


GTAACNCGAT 


TACGTTTNGT 


GCTTAANGAT 


GAAANCANAG 


540 




TTGATAAAGA 


CGCATTAAGG 


AACAACGCGT 


TGGTCAAGGG 


GCAGTTCAAA 


GCAGACCATC 


600 


45 


AATATCAAAT 


TGTCATTGGT 


CCAGGANCAG 


TCGATGAAGT 


GTATAAGCAG 


TTTATTGATG 


660 




AAACAGGTGC 


TCAAGAAGCT 


TCGAAAGATG 


AAGCGAAACA 


AGCAGCTGCG 


AAAAAAGGGA 


720 




ATCCAGTACA 


ACGTTTGATC 


AAATTGTTAG 


GGGAGATTTT 


TATACCAATA 


TTACCTGCGA 


780 




TTGTGACAAC 


TGGTTTGTTA 


ATGGGGATTC 


AATAATTTAC 


TTACAATGAA 


AGGTTTATTT 


840 


50 


GGTCCCAAAA 


GCACTTATTG 


AGATGTATCC 


CGCAAATTGC 


TGATATTTCA 


AACATCCATT 


900 




AATGTGATTG 


CGAGTTACCG 


CATTTATTTC 


CTTACCANCA 


TTAATTGGTT 


GGAGTAATAT 


960 




GCTGTGTATT 


TGGTGGTAGT 


CCGATCCTAG 


GCATAGTCTT 


AGGTTTGATT 


TTAATGCATC 


1020 




CGCAATTAGT 


ATCTCAGTAT 


GATTTGGCAA 


AAGGGAATAT 


TCCGACGTGG 


AACTTATTTG 


1080 


55 


GCTTAGAGAT 


TAAGCAGTTG 


AATTACCAAG 


GTCAAGTGTT 


GCCTGTTTTA 


ATTGCAGCTT 


1140 




ATGTTCTAGC 


TAAAATTTGA 


AAAAGGATTA 


AATAAAGTCG 


TTCACGATTC 


GATAAAAATG 


1200 



131 



EP 0 841 394 A2 



10 



15 



25 



35 



45 



50 



TTGG TCGTTG GACCGTAACG CTTTTAGTTA CTGGATTTTT AGCATTTATT ATCATTGGAC 12 60 
CAGTTGCATT ATTGATTGGT ACAGGTATTA CTTCTGGTGT TACATTTATA TTCC 1314 

(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 972 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

ATAATTATTG NTGGAAAATA ACATTGAGCC CAATTCCCAA CAGTCGCCGA CATTTTCCAT 60 

CCATTTACCC CCCCGTGACT TTGTATCCGA ATTTCNACCN TCCACCNAAG TTGNCATCAA 120 

TATGTTAATA ACCCAAATCC CAAATTGGAA ATATTTTGCC AAGCCAAGTT TTGATGATGC 180 

GCATTTGTCC TAATATTTTC CACTTAACAA GATCCGACCC CCAGCCAAGA AAATACATCA 240 

ACTGAGAAAT ATAATGNC C A AGAC TGGTTT CCAACACACA TTCGACATAA TGAGTTGTCT 300 

AAATTGACTG AGCAACAACT TGTGATTCAG TTGCTTATGC ATATGATTCA TTATGGCACG 360 

AGCGTACATA TCATTCGAAC CCAAAGTATC TTAAATGATG ATAAAGTGAA TCAAGTATGC 420 

GACTATATCG AGTTACATTT TCATGAAGAT TTAAGCCTTT CAGAATTAAG CGAATACGTT 480 

GGGTGGTCAG AGAGCCATCT GTCTAAAAAG TTTACAGAAT CGCTAGGTGT AGGATTCCAA 540 

CATTTCTTAA ATACGACGCG AATTGAGCAT GCGAAACTCG ATTTAACATA CACAGATGAA 600 

ACGATTACTG ATATTGCATT GCAAAATGGC TTTTCAAGTG CAGCGAGCTT TGCGAGAACA 660 

TTTAAACACT TTACGCATCA AACGCCTAAA CAATATCGAG GTGATCGTCC AGCAATCACT 720 

GAAAATCAAC AATCGGCACA ACATAATTAT CACGACCGTG AATTGATATT AC TTTT AAAT 780 

GACTACATTG AAGAAATGAA TCAATTCAAT TGAAGATATT GAAAAGNTGA ACTTATAAAG 840 

AGATTGCCTT NAAACCAACT AATCAACAAC TAAATCCAAT TATAATCCAT ATTATTCAAG 900 

TTGGGCTATT TGAGGAATTT GCTCCAATAC ACAGTATCAA TCCACAGTTG CTTACATGTT 960 

CATCCCATGA TT g?2 

(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 
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5 



10 



AAACCATTTT 


CCAAAACCAA 


GCTGGGCATC 


CAAGTTTTCC 


TATTGTAATC 


TGCAGGTAAC 


60 


CAAGGGAATG 


CTTCCGTGAA 


TTTAGGTGGT 


AGCGTAACAT 


CTATTCAACC 


ATTACGTATT 


120 


AATTTAACAA 


GTAATGAGAA 


TTTTACAGAT 


AAAGATTGGC 


AAATTACAGG 


TATTCCGCGT 


180 


ACATTACACA 


TTGAAAACTC 


GACAAATAGA 


ACTAATAATG 


CTAGAGAACG 


TAACATTGAA 


240 


CTTGTTGGTA 


ATTTATTACC 


AGGGGATTAC 


TTTGGTACGA 


TACGTTTTGG 


ACGTAAAGAA 


300 


CAATTATTTG 


AAATTCGTGT 


TANNCCACAT 


NCACCACAAT 


TACAACGACA 


GCTGAGCAAT 


360 


TANGAGGTCA GGAATTACAA 


AAGTGCCTGT 


TAATATTTCG 


GGAAT AC CGT 


TGGATCCATC 


420 


GGCATTGGTT 


TATTTANTTG 


CACCAACTAA 


TCAACTACGA 


ATGGTGGTAG 


TGAGGCAGAT 


480 


CAAATACCAT 


CTGGTTATAC 


CAT 








503 



(2) INFORMATION FOR SEQ ID NO: 156: 

75 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1118 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 



30 



35 



40 



45 



50 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 156: 






GGCACGAGAC 


TCAAANCACT 


GAAGCATTAA 


CAAAATAATA 


CTATATTACT 


GTCTAATCAT 


60 


AGACATGTTG 


TATTTAACTA 


ACAGTTCATT 


AAAG TAGAAT 


TTATTTCACT 


TTCAATGAAC 


120 


TGTTTTTTAT 


TTACGTTTGA 


CTAATTTACA 


ACCTTTTCAA 


TAGTAGTTTT 


CATGCCACGA 


180 


GCTATCCTAA 


CCCACAGATT 


AGTGATTTCT 


ATACAATTCC 


TCTTTTGTCT 


TTACATTTTC 


240 


TTAAAATATT 


TGCGATGTTG 


AGTATAAATT 


TTTGTTTTCT 


TCCTACCTTT 


TTCGTTATGA 


300 


TTAAAGTTAT 


AAATATTATT 


ATGTACAACG 


ATTCAATCGC 


TCTATTTTTC 


AACTTTCAAC 


360 


ATATTATTAA 


TTCGGAAAGG 


ACCACTTTAA 


AATTTAACNG 


GCCACAACAA 


ATCAAATCAA 


420 


TTAATCACTT 


TTTCCAAAAT 


AATCATATAA 


GGAGGTTCTT 


TTCATTATGA 


ATATCATTGA 


480 


GCAAAAATTT 


TATGACAGTA 


AAGCTTTTTT 


CAATACACAA 


CAAACTAAAG 


TTATTAGTTT 


540 


TAGAAAAGAT 


CAATTAAAGA 


AGTTAAGCAA 


AGCTATTAAA 


TCATACGAGA 


GCGATATTTT 


600 


AGAAGCACTA 


TATACAGATT 


TAGGAAAAAA 


TAAAGGCACG 


AAGCTTATGC 


TACTGAAATT 


660 


GGCATAACTT 


TGAAAAGTAT 


CAAAAATGCC 


CGNAAGGAAC 


TTAAAAACTG 


GACTAAAACA 


720 


AAAAATG TAG 


AC AC AC C TTT 


ATATTTATTT 


CCAACAAAAA 


GCTATATCAA 


AAAAGAACCT 


780 


TATGGAACAG 


TTTTGATCAT 


TGCACCATTT 


AACTATCCTT 


TTCAACTAGT 


ATTCGAACCT 


840 


TTAATCGGTG 


CTATTGCAGC 


AGGTAATACA 


GCAATTATTA 


AACCATCTGA 


GTTGACACCA 


900 


AATGTTGCAC 


GAGTGATTAA 


ACGATTAATC 


AATGAAACAT 


TTGATGCAAA 


TTACATTGAA 


960 


GTTATTGAGG 


GAGGAATTGA 


AGAAACGCAA 


ACGTTAATTC 


ACTTACCTTT 


TGACTATGTC 


1020 


TCTTACAGGA 


GTGAAATTGT 


AGGCAAATCG 


TTTATCAAGC 


TGCAGCGAAA 


TTTAGTCCTG 


1080 


TGACATAGAA 


TGGTGGGGAA 


ATCTCC AG TC 


ATCGNGGG 






1118 



(2) INFORMATION FOR SEQ ID NO: 157: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

10 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 157: 



15 



20 



25 



CCANTTGCCG 


TTTCCTCCTA 


AACACCAGCG 


GNACGAGCTC 


ATGATGGCAT 


ACATTGTAAA 


60 


TCCGATAATT 


GACAGTCCAG 


TTGCTAATCC 


ATCTAAACCA 


TCTGTTAAAT 


TTACCGCATT 


120 


AGAAAAACCT 


AC TTGCC AAA 


AAACAATGAA 


AATAACATAT 


GCAAATGATA 


GTGGGATTGC 


180 


TACATTCGTA 


AATGGAATAT 


GTATGCTCGT 


AGAAAAATTC 


ACCAAATGAA 


ACACATTACT 


240 


TAAAACAAAG 


AAAATAATCG 


CAATACCAAT 


TTGCGCCAAA 


AAC TTCTGTT 


TACTTGTTAA 


300 


ACCTTGGTTA 


TTCTTTTTAA 


CAACAATAAT 


ATAATCATCT 


ATAAAACCAA 


TT AAC C C AAA 


360 


ACCAATCGGT 


CACAAATAAT 


AACAGGTATG 


ATTGGATTAG 


CTTGATCTTA 


CAAATATAAT 


420 


AGCCACCAAA 


GACGGTTATC 


ACAAATACTT 


TAATAGAAAT 


GGTTAGGCCA 


CCCATCGTTG 


480 


GTGTACCAGT 


CTTCTTCATA 


TGGCTTTGTG 


GACCTTCTTC 


TCGAATACTT 


TGACCAAATT 


540 


TCATCCTTTT 


TAATGTAGGT 


ATTAAAACAG 


GTACCAAAAC 


AAATGTAATC 


AC TAGCGCT A 


600 


ATAACGCATA 


TACAAAAATC 


ATAACTATCT 


CCTCTTCTTA 


ATCCAGACTT 


TTTTAACCAC 


660 


TAATATATTA 


TCAAG 










675 



(2) INFORMATION FOR SEQ ID NO:158: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 6 base pairs 

35 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 



45 





CTTAGATACA 


ATTACTCAAT 


ATGATGTACT 


GGAAGCANTT 


ATAGATACTA 


AAAAACACAT 


60 




TGNCTGTNCG 


ATGANTNTCA 


TCTTCACATG 


ANNTANCGGG 


TTGATTAACN 


AAGATTGCAG 


120 




ACCGTGNTGT 


TGTGATGANA 


AATGGTCANC 


TGATAGAGCA 


TGGTACACGT 


GAATCAGTCT 


180 


SO 


TGCATCATCC 


AGAACATGTT 


TATACGAAGT 


ATTTATTATC 


ANCGNAGAAG 


AAGANTAATG 


240 




ATCATTTTAA 


ACATGTGATG 


AGGGGTGATG 


TACATGANTA 


AAGTTACAGA 


TGTTGAAAAA 


300 




TCATATCAAA 


GCNCACATGT 


TTTTAAGCGT 


CGTCGAACAC 


CTATCGTGAA 


AGGTGTGTCA 


360 




TTTGAGTGTC 


CAATCGGTGC 


GACGATTGCG 


ATTATCGGAG 


AAAG TGG TAG 


CGGTAAATCG 


420 


55 


ACGTTGAGTC 


GTATGATATT 


AGGTATTGAG 


AAACCGGATA 


AAGGCTGTGT 


AACCTTAAAT 


480 



134 



BP 0 841 394 A2 



GATCTACCGA TGCATAAGAA GAAAGTCAGA CG TCATC AAA TTGGTGCTGT ATTTCAAGAT 
TATACGTCAT CATTACACCC ATTTCAGACT GTTAGAGAAA TCTTATTTGA AGTGATGTGT 
CAATGTGATG GACAACCTAA AGAAGTTATG GAAGTCCAAG CAATTACATT GTTGGAAGAA 
GTCGGTCTAT CTAAGGCATA CATGGATAAA TATCCTAATA TGTTATCAGG TGGAGAAGCG 
CAGCGTGTTG CGATTGCGCT CGTGCC 

(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

AATGTCAGAC AGATACTGCC ACAAGATGCG TGCNTATGAG ATTTCGCTGT GTATGAATAG 
CGACACGAGC GGCACGAGCG CTCACNTCAT TTCCAATTAA AACTAATGCC TAAATCTGAT 
GCAGTAAAAT CTATCATGAT TC AC TCTTAA CATCCGTATT TCGTGCTACT AATTGATGTC 
TTGCATTGAA AAATTGACCA TAGCTTAAAT ATGTCGCAAT CAAAGCAGAC ATAATGNTCG 
CAGTTGTATG AATAAACACG ACTAACAATT GAAATTTAAT CGCTTG T AAA GGTGGTACGC 
CACCAATAAT TAAGCCTGTC ATCATACCAG GAATCGACAC AAGCCCATAT GTTTTAACCG 
AATCAATTGT TGGCACCTAT AGC TAAACG A ATACTTTCAC GTATTGCACC TTTAGAAGCC 
AATTTAGGTG TAGCTGCAAG TGATAATTTA GATTCAATAT TAGTACCATC TTGTACGAAT 
GCACGATCTA AATTCTGGTA AGCTAAATTA ATTGCAATCA AGCCATTATT TNCAAGCATN 
CCGCCGATAG GTATAACTTC ATTGGCTGTA AAATGAATTG CCCCTGTAGC TACAGTACCT 
GCAAGTGGTA ATGCTGTTCC AATGAAGATN GCTGGAAATG GTNTCCAAAA CACANGGGGC 
ATCACTGTGA TGCTCGACTA ATGGTAAGAG TC 

(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 857 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 
AATATAGCGT TTTNACCCCT TTGTGTNACC CTCCCCAAGA GATATAACAT NCCGCCGTNT 
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AAAATCAATT 


AGAAATATCT 


TTTTATTCTG 


ATAATAGACA 


CAGTATAGAC 


ACATTTTAAT. 


120 




GGGTGATACC 


ACTTGTAATA 


TCACGGGGTT 


GTNATGTNTT 


GNATATCANT 


NAAATACTTA 


180 


5 


TATANAAATA 


TTGCTCGGAA 


TATAAAAAGN 


TAAATAGGNT 


TTTGAGTTTT 


AAATATGAAA 


240 




TACAAAGCGC 


CCANTCGAAC 


AAAGTATTTA 


TATTAAAATA 


TGGAAAATCC 


ATC ANT ATT A 


300 




AATTAAAATN 


GTTTTATTAT 


GATAAAGTGA 


AAGTAGGTAA 


GTCTATGGAA 


GGTCTTAATC 


360 




ATCGAAGAAA 


TACAGAAAAA 


GAAGAGACAA 


CACAAACGCA 


ATC AG TTGC A 


CCTAATACAG 


420 


10 


GTGAAGAGGG 


GATGTCATCA 


GGC AAGTAAC 


ACAATCAANT 


AAGACGTCCG 


ACATACATAA 


480 




ATGAATCTAT 


CAATAAACAA 


ATGGAAGCCA 


AAGCGCATGA 


AACAGCGCAA 


AATGCAGATT 


540 




TAAAAACCGA 


AGCAAGAAGT 


TTATTTGATA 


ATGCAACCAA 


ATCAATCGGG 


AGACTAGCCG 


600 




GCAATGATGA 


AAGCATAAAT 


CTTAATTTAA 


AAGATATGTT 


TTCTGAAGTA 


TTTAAGCCGC 


660 


15 


ATACTAAAAA 


CGAAGCAGAT 


GAAATATTTA 


TAGCGGGTAC 


TGCTAAAACT 


ACGCCAGCAA 


720 




TTTGTGACAT 


ATCAGAAGAA 


TGGGGGAAGC 


CATGGCTCTT 


TTCTCGAGTA 


TTCATCGCTT 


780 




TCACAGTAAC 


ATTTATTGGA 


TTATGGGTCA 


TGGCAGCGAT 


TTTTAATAAC 


AATGACGCTT 


840 




GTACCGGTGC 


TCGTGCC 










857 



20 

(2) INFORMATION FOR SEQ ID NO: 161: 



(i) SEQUENCE CHARACTERISTICS: 

25 (A) LENGTH: 907 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

35 





CTCACTTTGT 


TTTCCAGTAT 


GGAACGCTTG 


CCCTAAGTCC 


TTAATTGCAT 


TATAAAATTC 


60 




AGGCGCT AAA 


ATAATCGCAA 


TTGCCGCAGT 


TTTAAAATCA 


ATATTATGAA 


AT ACT AC T AA 


120 




GCTTAGCGTT 


GCTTCCAATG 


CAACCAATCC 


AATACCTAAC 


ATACTTATAA 


ATTCGAGCAT 


180 


40 


TAATCCCGAT 


AAAAAAGCAC 


TGCGTAAAAT 


GCGCATTGTT 


AAAGTTCTAA 


ACTGAGTACT 


240 




ATCGTCGTAA 


ATATGCTTCT 


CTGTTTGCTC 


TGTACGATTA 


AATAGCTTTA 


ACGTCACTAA 


300 




ACCTNTAGCA 


ATATTTAAAA 


ACCGNCGACT 


AAATTGATTC 


ANATAAGTCA 


TTTGATCTTT 


360 




TTGACGCATC 


GAGCGTTTTC 


AAACCGAAAA 


TAATATAAAA 


CAAAGGAATA 


AATGGTGCAG 


420 


45 


NTATTAACAT 


AATTAATGCG 


GNATTGAAAT 


GGATGAAAAA 


CAATGCAATG 


ATTATGATGA 


480 




GCNGAACCAT 


CAATCGATTT 


GAACAACTTG 


AGGCAAATAA 


CTCTTATAAA 


AAGGTGCTAA 


540 




ACCATCAATG 


TTTTCTGTGA 


GTATAGTCAT 


TTGTTCACCG 


ATTGGATGAC 


CATTATTTTT 


600 




ATAAATAACC 


CGCTGTCTAA 


GCATATGCTT 


AACTTTAAAT 


GCTAATGTAT 


CACCTAACCA 


660 


50 


TTGATTTAGA AATTGCACAG 


NTGCTCTTAA 


AAGTAAAACA 


CCTAATAAAA 


TAAATAATAC 


720 




AATCCATAAA 


CCTTGAAATT 


GATGTCTTAT 


AATTTTAGCT 


AAAAAATCTG 


CTATTAAAAT 


780 




ATTGTGCGTT 


ATAACGAGTA 


TGCCCGAGAC 


CAGTACTGAC 


CAAGAACATG 


AGTACCGGAA 


840 




AAATTTTATA 


TTGAAACAGT 


ATTGTTGTTA 


ATTTTTCACA 


ATTATATCAC 


CTAACCTATA 


900 


55 


TAAAGTT 












907 
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. (2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 774 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



15 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO:: 


L62: 








AATTTTAAAG 


ACCCCNCGCA 


TAAATANCCA 


TCCCACCTAC 


TTATCCAAAA 


GTTANGTGGA 


60 




TGGTTTTTCA 


ATTAAAATTA 


ATATTAGTGT 


AANCCAATCA 


AAGATTTAAT 


CNAATATGCC 


120 


20 


CCTGCTCAAA 


ACATTTCCTC 


ATTTAATTTG 


CTTT AC TTTC 


AATTTAATAT 


CATTATCCAC 


180 


AACACTTGGC 


GTGTCATCGT 


TATTATTTCG 


CATCTTTGAC 


ACGTTTATCA 


TCATTAGGAN 


240 




TCGGCACCGA 


ATAAAATTGC 


GATAAATGCC 


ATGATTC CCA 


TTAATACGTT 


AACCCAAAGT- 


300 




GCAATCATCG 


CACCTGTATG 


AATGCTCGTT 


GCAGCAACTG 


CACCAACATA 


TACAGCACCA 


360 


25 


CTAATTGCGA 


CACCGAATGC 


GCCACCAAGT 


GATGAAGCCA 


TTTTATAAAT 


AC CTGAAGC A 


420 


ACGCCAACTT 


TATC TAACGG 


TGCATTCGAA 


ATAGCTGTAT 


CTGTAGAAGG 


TGTTGCATAA 


480 




ATACCTAAGC 


CTAGTCCGAA 


ACATAAATAT 


CCTACGACAC 


AACTGATAAC 


ATAAAATATG 


540 




CCTGGTAAGA 


ATACTANTGA 


AATAAGTGCA 


ATNCCAATGA 


CCACAATGNA 


TGTACC TNAT 


600 


30 


AACATTGGTC 


GCTTAGAACC 


CANTTNTGGT 


NATAATAATT 


TTTCACCAAC 


TCGAATCATC 


660 




AATAACAACA 


TGATTAAATA 


AGTAANTGAT 


NAGTATCCTG 


CCTGCCATNC 


TGTATAACCT 


720 




AAACCTTGTT 


GCACGCATGT 


ATTCGCTACA 


ATTNATGTAC 


CTACAACNCC 


GTTG 


774 



(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 773 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 



SO 


CTCTGATTCA 


ACAAAATGAT 


TATTCTTTAC 


GTAAACTACC 


TTTTTTTATT 


TGAGATGAAG 


60 




CATATGCTTT 


TAATAATATT 


GTCCCAATAA 


TACCAACTGA 


AATAATATTT 


AATACTGCAG 


120 




AGATAACACC 


TTGTGTATAA 


ACCTTGTTAG 


CCGGTTCGTT 


ATAAATCAAA 


ATATCTAATG 


180 




TTGGTGCAAT 


AAGTGCCCAG 


CAAATAATAT 


TCGCAATAAT 


TTGACCGATA 


TTAAAATAAA 


240 


55 


TCATCGATTT 


CCTAGAAAAT 


AGGCATGAAG 


AAAGATTTAA 


TTTT AGGGC C 


AATCCATCCA 


300 




TATTAAACAG 


GCGATAATTC 


CTGAACAAAT 


AACCCAACTC 


CACCAAGCAC 


TACCCGTATG 


360 
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TCGGGGAAAT 


CTTTAATAGC 


GTGNCCAACT 


AATCCAGGCA 


TTAAACCAGC 


AAAAGGCCCA 


420 




AATATTGCAG 


ATATTAATGC 


TAAAAATGCA 


TAAGATGTTT 


CTATATTCGT 


ATTAGGAAAA 


480 


5 


CCTGTTGGTA TTACAACAAA ACGCCCTAAA ATCACAAATA 


CCGCNGCTCC 


TATACCAATC 


540 




GCAACAACAG 


TTTTAACTGA 


AATATCNTGT 


TTTTTCATCT 


TCATTACTCC 


TTACATAAAA 


600 




AATTCATTAA 


ATTGATGGTG 


CTTTAGATAA 


ATGAATCGTC 


CAATCATTTC 


CAGTACCAAT 


660 




ATGATATAAA 


TCTGAAAATG 


AGTCTCGATT 


GACTGCTACA 


CCAATATTTA 


CTAGCGAGTT 


720 


10 


AACATACACA 


AGAGGTTCAC 


CCACATTAAC 


ATCTGCAAAC 


GATCGCTCGT 


GCC 


773 



(2) INFORMATION FOR SEQ ID NO: 164: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 676 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 





CATAAGACAT 


GTAAATTCTA 


AGATGATTTG 


TTGGATAGGG 


TAGCTCCATA 


TAATTC TTGA 


60 




ATCCNATCAT 


TATTACATTA 


ATAAAAAAAA 


CACCCACAAT 


TGTGGGTGAT 


TGTAATGAAC 


120 


30 


GTATTATCTT 


GGCACGAGTA 


CTCAACGATT 


AATTGTTCGT 


TAATTTCAGC 


AGGTAATTCG 


180 




CTACGTTCTG 


GTAAACGTAC 


GAAAGTACCA 


GTTAAGCTGT 


CAGCATCAAA 


GTTTAAGTAC 


240 




TCAGGTACGA 


AATTGTTGAT 


TTCAACTGAT 


TCAACGATGA 


TGTTTAGTTT 


TTGAGATTTT 


300 




TCACGAACTG 


AAATTGTTTG 


ACCAGGTTTA 


ACAGAATAAG 


ATGGAATATC 


AACACGTTTA 


360 


35 


CCATCTACTA 


AGATATGACC 


GTGGTTAACT 


AATTGACGTG 


CTTGACGACG 


AGTACGAGCT 


420 




AAACCTAATG 


AATAAACAAC 


AGCGTCTAAA 


CGACTTGCTA 


ATAAAATCAT 


GAAGTTTTCA 


480 




CCCGCGTACA 


CCCNAATTTT 


TTACCAGCGA 


TGTCAAATGT 


GTTACGGAAT 


TGTCTTTCAG 


540 




TC AATC CAT A 


TTANGTAACG 


TAATTTTTGT 


TTTTCACGTA 


ATTGTAAACC 


ATATTCTGAT 


600 


40 


AATTTTTTAC 


GTTGGTTTGG 


ACCATGTTGT 


CCTGGTGCGT 


AAGGACGTTT 


TTCTAATTCT 


660 




TTACCAGTCT 


CGTGCC 










676 



(2) INFORMATION FOR SEQ ID NO: 165: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 397 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 
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15 



25 



35 



GGC ACGAGCG ATGATGGCTG TCGGAACAGG TGCATTTGGT GCGCATGGTT TACAAGGGAA 60 

AAATAAGTGA TCACTATTTA TCAGTATGGG AAAAAGCAAC GACGTATCAA ATGTACCATG 120 

GCTTAGCATT ATTAATTATA GGTGTAATTA GTGGTACAAC TTCAATCAAT GTTAACTGGG 180 

CTGGCTGGTT AATATTTGCT GGTATTATTT TCTTTAGTGG ATCATTATAT ATTTTAGTAT 240 

TAACTCAAAT TAAAGTTTTA GGTGCGATTA CGCCAATTGG TGGCGTATTG TTCATCATTG 300 

GCTGGATAAT GTTAATCATT GCGACATTCA AATTTGCTGG TTAAATTTTA AAACTTTAGA 3 60 

TTACCTATGT AACTAAACAT TAAATTTTTA ANAAAAA 397 

(2) INFORMATION FOR SEQ ID NO: 16 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 739 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 



TATCTTTAAG 


AACAATTGAA 


GCCGCTAACC 


ATCGGAGAAA 


GCTGTAAAAT 


CCAAGTGTTG 


60 


GCCTGCTAAA 


TACCCACAAT 


CATCACGGGT 


CGTTGCCTTG 


TACCACAATA 


GAAGGCAGCA 


120 


TTATCACCCA 


AATATTTGCA 


TAGCTAATTG 


TGATAAAACT 


GTCGTTTCCG 


TTTGTGGCAT 


180 


AATTCCATAA 


ACATATGCTA 


AACCACCGAT 


ACCAACTAAT 


AAAAACGCTA 


AAATTGAACC 


240 


CATAGCAATT 


AACGTTTTTA 


CAGCATTATT 


AGCACTTGGT 


TCTCTAAAAA 


TTGGTGACCG 


300 


CATTTGAAAT 


AGCTTCAACA 


CCTGTTAATG 


ATGAAGCCCC 


TGATGAAAAA 


GCTCTTAATA 


360 


GC AAGAATAA 


TGTTACTCCA 


GGANCCGCAG 


TTCCTACTGA 


TGCATGCATA 


TGTGGTTGAA 


420 


TATCTCCTGT 


CGCCACACGG 


AAAGTACCCT 


ATAAATATTA 


ATATCACTAA 


CCCCATAATG 


480 


ANAAGATATA 


CTGGATAGGA 


TAATACGGTG 


NCAGATTCAG 


TTAAACCCAC 


GTAAATTTAA 


540 


TATTAAAATA 


AAAAGTACAA 


GTAAACATGC 


AATCAGTACT 


TTATGCCCAT 


ATAAAC TTGG 


600 


GAATGCAGCA 


ACANATGCAT 


CAGCACCAGA 


TGATATACTA 


ACAGCGACAG 


TCAGTATGTA 


660 


ATCGACTAAT 


AATGAGCCTC 


CTGCAAGCAA 


TCCCCATTTT 


TCTCCTAAAT 


TGGTCTTGGA 


720 


CACCATATAC 


GCTCGTGCC 










739 



(2) INFORMATION FOR SEQ ID NO: 167: 

45 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 507 base pairs 
BQ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5S (ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167; 



5 



10 



GGCACGAGCG 


AGAACGATTG 


AAGCTACAAT 


AC C TGATGTT 


GCTGCGGGAA 


GTACGACTTT 


60 


AGTTGCTACT 


TCTAATTTAG 


TTGCTCCAAG 


TCCATAGGCA 


CCTTCTCGAA 


TTTTATTTGG 


120 


TACAGATGCC 


ATTGCATCCT 


CACTCAAACT 


TGTGATGAGA 


GGGACAATCA 


TAATACCGAC 


180 


AACTAAGCCG 


GGAC TT AT AG 


CATTAAACTC 


TCCAAGACCT 


GATATGAAAG 


ATCTTAATAC 


240 


TGGTGTAACA 


AAGGTTAATG 


CAAAGAAACC 


AAACACAATT 


GTTGGTATTC 


CTGCTAAAAT 


300 


TTCTAATATC 


GGTTTAATTA 


TGCGTCGGGC 


ACGGGCACTT 


GCATATTCAC 


TTAAATAAAT 


360 


TGCTGCACCA 


AGCCCGACTG 


GAACTGCAAA 


TATAGTCGCA 


ATAACTGTGA 


TTTTTAAAGT 


420 


CCCTATTATC 


AATGCCAGAT 


ACCAAACTTA 


GGGTC TGAAC 


CGGTAGGATT 


CCAGTAGTAG 


480 


AAATAGAAAT 


CAGTATTGGA 


ATTCTGG 








507 



15 

(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 753 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE : Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 



35 



40 



45 



G AATAGTC T A 


CTCATTCATC 


GTCCTTCACC 


ATTGATGGAT 


CCAGAACAAG 


TTGCTGATGC 


60 


ATTAACTAAA 


CTTGTTAAAC 


AAGGTAAGTT 


GAAGTCATTC 


GGGGTGTCGA 


ATTTTAATCA 


120 


TTCACAATAC 


CAATTGTTAA 


ATCAATATAT 


TATGAAAGAA 


AGACTACATA 


TTAGCATCAA 


180 


TC AA TT AG AA 


TTATCGCCAT 


ATCACGTTGA 


TAGTTTACAA 


GATGGAACAA 


TGGATTCAAT 


240 


GTATCAAAAC 


CATGTTCAAA 


TTATGGCTTG 


GAGTCCTTTT 


GCAGGCGGTA 


AAATTTTCGA 


300 


CAAGGAAGAT 


ATTAAAGCGC 


AACGTATTAT 


GAAAGTTGTC 


AATCAATAGC 


TGACAAATAT 


360 


GGTGTGAGTG 


ACACAGCTGT 


GATGATAGCA 


TGGTTAGTAA 


AAATACCGCA 


TAGTACCATG 


420 


CCGATACTGG 


GAACAAGTCA 


GTTAAAGCGT 


ATNGATCAAG 


CAATCGAAGG 


GCTACAACTT 


480 


AATTTAGATG 


ATCAAGTCGT 


GGGTTGACAT 


TTACAACGCT 


ATTATCGGAC 


AAGATATCCC 


540 


GTAAACTTAN 


NNACNCNNAA 


ATCATAAATG 


GAGCATACCA 


TGACAAACGA 


AGATAAACGT 


600 


TTCGAACAAT 


TAAGATTTGA 


ACGCAAATTT 


AT AG TTATCC 


CGTATTTAAT 


TTATGCAGTC 


660 


ATTGTATTAC 


TATTAAATAT 


TTTCTATTCT 


GATT TGAAAA 


TAACAATGAC 


ATTATTCGGA 


720 




CGTATAATGT 


AGTCATTTTG 


TTC 






753 



(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 542 base pairs 

(B) TYPE: nucleic acid 
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-(C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 



10 


CCCATTATTA 


TTTNGAAAAT 


GAATTGGAAA 


ATACAGTAGA 


CGAAGTGTGG 


GTTGTATACA 


60 




CTTCTGAAAG 


TATACAAATG 


GATCGTTTAA 


TGCAACGTAA 


TAATCTGNCA 


TTAGAAGATG 


120 




CGAAAGCACG 


TGTCTACATA 


CATCAACGCG 


TACGTAAAGT 


AATGACTTAT 


TTATAGTGTA 


180 




ATATTAATCT 


TCTTCTCCGN 


AATTCGGNTT 


TNCAATATAA 


CCTTCTTCTT 


CTAACAANCT 


240 


15 


CTCAAGGTTG 


TGNGTNAATN 


CAAGTTTATC 


CCCTAAATTA 


TCGATAACAT 


GATCGGCCAN 


300 




TCGGNTNNNN 


NNATCAATAG 


AAATTTGGNT 


TATAGACACG 


TGCTTTCGCA 


TCTTCTAATG 


360 




ACAGATCATN 


ACGTNGCATT 


AAACGATCC A 


TTTGTATACT 


TTCAGAAGTG 


TAT AC AAC CC 


420 




ACACTTCGTC 


TACTGTATTT 


TCCAATTCAT 


TTTCAAATAA 


TAATGGAATA 


TCCACGAACC 


480 


20 


ACATTATATC 


CTTGGTTTTA AATATTCTTG 


CTTTTCTTCT 


GCCATAATAT 


CTCGCACGAA 


540 




TA 












542 



(2) INFORMATION FOR SEQ ID NO: 170: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 731 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: Genomic DNA 

35 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 



40 



45 



50 



TCAATCCANC 


CTCCAATTGC 


TGATGAAGAA 




CNCCCACCCA 


TTANNGGGGC 


60 


CANAGTCATA 


AGTAACAACT 


TTTGAACCAA 


TTGTATCTTC 


AATTGAATCT 


GTAATCTTAT 


120 


CTCCCGCTTC 


TTCCCATCCT 


AAATGTTCTA 


ACATTAATAC 


AGAAC TT AAA 


ATTACTGAAG 


180 


NTGGATTCAC 


TTTATTTAAA 


CCTGCATATT 


TTGGANCTGA 


GACCATGTGT 


TGCTTCAAAA 


240 


ATAGCATGAC 


CTGTTTCATA 


ATTAATGTTT 


GCACCTGGCG 


CAATACCAAT 


NCCACCAACT 


300 


TGTGCAGCTA 


AAGCATCTGA 


AATATAGTCA 


CCATTCAAGT 


TCATAGTTGC 


TACAACATCA 


360 


TGCTCAGCTG 


GACGAGTTAA 


AATTTGTTGT 


AAGAAAATGT 


CAGCAATAGA 


ATCTTTAATG 


420 


ATAATCTTGC 


CTTCTTTCAC 


AGCTTTTTCT 


TGAGCAGCAT 


TAGCAGCATC 


TCTGCCTTCA 


480 


TTTACAACAA 


TTTCGACATA 


TTGTTGCCAA 


GTGAATACTT 


GCATCACCAA 


ATTCAGATAA 


540 


TGCTAAATCG 


TAACCCCACT 


GCTTAAATGA 


GCCTTCTGTA 


AATTTCATAA 


TATTACCTTT 


600 


ATGAACTTAA 


AGTAACTTGA 


TTTACGGGTT 


ATTTATCGAT 


AGCATATTGG 


TATAGCTGCT 


660 


CTACTAATCG 


CTCAAGTCCT 


TCTTTTAGAA 


CTTGGTTTAA 


TACCAATACC 


TGAAGGTTCT 


720 


TGGAATCGAT 


T 










731 
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(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



15 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 171: 








TTTGACATCT 


CTCAAAAATT 


AAAGCATAAA 


GGTTTATCGG 


AAAGGCGCAA 


AATTCACAGT 


60 




TGTTTGTTGG 


TTTGTTATTT 


CCCTCNCAAT 


ATTCAGTATT 


AGACATTTAT 


AGTTTGGAAA 


120 


20 


ATGCGTGATA 


ATTAGTTGTA 


TTCAGTTATT 


AAGTAATAAA 


TTTTTGGAGG 


CAGAACATCA 


180 




TGAAATTAAC 


ATTAATGAAA 


TTTTTTGTGG 


GGGGATTTGC 


AGTATTATTA 


AGTTATATTG 


240 




TATCTGATAA 


CAATAACCTT 


GGGAAAGAAT 


TTGGCGGTAT 


ATTTGCAACG 


TTTCCGGGCA 


300 




GTATTTTTAG 


TGTCTATGTT 


TATTACAGGT 


ATGCAATATG 


GTGATAAAGT 


CGCTGTGCAT 


360 


25 


GTAAGTCGTG 


GCGCAGTGTT 


TGGTATGACA 


GGGGTATTAG 


TTGTATTTTA 


GTAACATGGA 


420 




TGATGTTACA 


TATGACGCAC 


ATGTGGTTGA 


TTAGCATTAT 


TGTTGGTTTC 


CTAAGCTGGT 


480 




TCATCAGTGC 


AGTATGTATT 


TTTGAAGCGG 


TAGAATTTAT 


AGCACAAAAA 


AGATTAGAAA 


540 




AGCATAGTTG 


GAAAGCTGGA 


AAATCGAATA 


GTAAATAGTG 


TGAACGTAAT 


CTCTTAACTA 


600 


30 


GGACTAACTT 


TGCAAGCATT 


GAATAGCATG 


GAAAAGTTGC 


ATCATTAATA 


AGTGAAATTC 


660 




AAGTTGGCAT 


TGAGAAAATT 


ACAAGCGCTC 


GTGCC 






695 



(2) INFORMATION FOR SEQ ID NO: 172: 

35 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 612 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

45 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 





GGCACGAGCT 


CGGCTTGACG 


GTAGGAAATA 


TCAGCACGAG 


CTTGATTCAA 


CANCCGAATC 


60 


50 


AGGGAGGAAA 


TGCAGGTATA 


ACTCAATCTA 


AGTTCGCTAA 


GAGACATCCA 


ACGTTTTACN 


120 




AGGCAAATCC 


AAGCNAAAAG 


GATTGCTTCN 


AATAATATCC 


CACCAAACAA 


TTTAAGACAT 


180 




TATGCTGTTA 


AGAGGTCACC 


AACNATATAT 


ATAGTGGTTA 


CGGATCAGTT 


ATTAGCGTTC 


240 




TTTAACAACA 


GATATTGGCG 


CTCACAGTTT 


AACCCAAGAG 


GTGGTTGGTC 


TCCAAGTGGT 


300 


55 


C C AAGAAGAT 


ATGCGAATGG 


TGGTTTGATT 


ACAAAGCATC 


AACTTNC TG A 


AGTGGGTGAA 


360 




GGAGATAAAC 


AGGAGATGGT 


TATCCCTTTA 


ACTAGACGTA 


AACGAGCAAT 


TCAATTAACT 


420 
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35 
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45 



GAACAGGTTA TGCGCATCAT CGGTATGGAT GGCAANCCAA ATAACATCAC TGTAAATAAT 480 

GATACTTCTA C AGTTGAAAA ATTGTTTGAA ACAAATTGTT ATGTTAAGTG ATAAAGGAAA 540 

TAAATTAACC GATGCGTTGA TCAAAC TGTT CTTCTCAGGA TAATACTTAG TTCTATGATG 600 

CACTTAGAAG TT 612 

(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 



GGCACGAGCG 


ACTTTTTCTA 


GGCATAATCG 


AATTGACAAT 


GGTACTCAAG 


CTTAAAAATG 


60 


GCCCACTTAA 


TTCAGGCAAT 


AACAGACTAG 


GCATAACATT 


ATTTTTCATC 


AATTTAAATG 


120 


TGTAAAACAT 


CGATGACATT 


GTCTGTTGCT 


GTTGTCGATA 


AACATTCATA 


TCGTAGCGGT 


180 


CTGCAAATTC 


TTTAATGCGA 


TATGCCGGCG 


GCACGAGACA 


TGACAGGTAA 


TGAATCATGT 


240 


TTGAATTGTT 


CGTCTACGGC 


ATCTTTTTGA 


ATAGGTAATC 


CAAAGAAACC 


TGCAATACCA 


300 


ATCGTTTCAA 


AGGGCCCTGC 


TGCTTCGATA 


TGTCTACGAA 


ATGGTTC TGA 


ACGAACATCT 


360 


ATACAAAATG 


CAATTTGCGC 


TTTCGTTGAT 


GTGCCCACCT 


GATTTAGCTC 


GCTATTATTT 


420 


TCATCAACTG 


CTTGTGTGTC 


AATTAACAAT 


AC TGAATGTG 


GCTGATTAGC 


GTTATCATTT 


480 


TCTGAGACAT 


TTACTTGGTT 


TACATCTAAT 


GCGCCCGCCA 


CACTTTCATG 


ACTGCTTTAA 


540 


TTTTTGTTTT 


AACTGAGATC 


GTATGTCAAT 


TTCCCAGGCA 


ATTAGCCATA 


AATTTNTAAA 


600 


TACAT 












605 



55 



(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 

GGCACGAGCT GATCAAGGGT GTAAGTTGGT AGTGGTCAAT AAAGAACAAT CATTACCAGC 60 
TAACGTAACA CAAGTGGTTG TGCCGGACAC ATTAAGGAGT AGCTAGTATT TCTAGCAACA 120 
ACAACATTAT ATGGATTATC CGGAGTCATC AGTTAGTGAC ATTTGGTGTA ACGGGTACAA 180 
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ATGGTAAAAC TTCTATTGCG ACGGATGATT CATTTAATTC AANGAAAGTT ACAAAAAAAT 240 

AGTGCATATT TAGGAACTAA TGGTTTCCAA ATTAATGAAA CAAAGACAAA AGGTGCAAAT 300 

ACGACACCAG AAAACAGTTT CTTTAACTAA GAAAATTAAA GAAGCAGTTG ATGCAGGCGC 360 

TGAATCTATG ACATTAGAAG TATCAAGCCA TGGCTTAGTA TTAGGACGAC TGCGAGGCGT 420 

TGAATTTGAC GTTGCAATAT TTTCAAATTT AACACAAGAC CATTTAGATT TTCATGGCAC 480 

AATGGAAGCA TACGGAC AC G CTCGTGCC 508 

(2} INFORMATION FOR SEQ ID NO: 175: 



(i) SEQUENCE CHARACTERISTICS : 
T5 (A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 
{C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 5: 



GGCNCGAGAC 


TGTCGAATAT 


TTAGTAGTAA 


CTTCAGATTA 


CAAGC GTATG 


ACTTATCGAC 


60 


CGAACGGTAC 


AAATAAAGTA 


TTTGTTAAAA 


GAAAAGAAGC 


GGGNTCATGG 


TCTGAGTGGT 


120 


CAGAATTAGC 


TATTAATGAT 


TACAATACAC 


CTTGTGAAAC 


TGNTCAAAGT 


GCCCANTCAA 


180 


AAGCTAATAT 


GGCCGAAAGT 


AACGCTAAAT 


TATACGCAGA 


TGACAAGTTT 


AATAAAAGGC 


240 


ATTCGAGTTA 


TTTGTGGATG 


GAACAGCAAA 


TGGTGTGGGC 


TCTACATTGN 


AC TT AAATGA 


300 


GAGTTTAGAC 


CAATTTATTT 


TATTAATTTT 


TTATGGGACT 


TTTCCAGGTG 


GTGACTTTAC 


360 


AGAGTTTGGC 


AGTCCTTTTG 


GAGGAGGAAA 


GATTTCATTG 


AATCCCTCAA 


ATCTTCCAGA 


420 


TGGTGATGGA 


AATGGTGGAG 


GTGTTTATGA 


GTTTGGATTA 


ACTAAATCTA 


GTCGTACATC 


480 


TTTAACTATA 


TCAAACGATG 


TCTATTTCGA 


CTTAGGAAGT 


CAAAGAGGCT 


CTGGTGCGAA 


540 


CGC AAATAGA 


GGGACAATTA 


ACAAAATTAT 


AGGAGTGAGA 


AAATAATGCA 


AATATTAGTT 


600 


AACAAGCGTA 


ATGAGATAAT 


TTCATACGCT 


ATCATTGGTG 


GCTTTGAAGA 


AGTATGATAT 


660 


TGAAATTACA 


GAAATTCTCT 


CAAGTTTTAG 


ACTAAGCTTT 


AAATATCAAT 


GGGAATAGTT 


720 


TTACGAAGAT 


ATCCGAGAAA 


AGATGACTGC 








750 



(2) INFORMATION FOR SEQ ID NO: 176: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 17 6: 
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AAATGCCGGG 


GGAGCTCAAG 


TAT ATG AC TG 


AAATAACATT 


CAAAGGTGGA 


CCAATCCACT 


60 


AAAAAGGTCA 


ACAAATTAAT 


GAAGGTGATT 


TTGCACCTGA 


TTTTACAGTG 


TTAGATAATG 


120 


AC TTAAATC A 


AGTAACATTA 


GCAGATTATG 


CTGGTAAAAA 


GAAATTAATT 


AGTGTGGTAC 


180 


CATCAATTGA 


TACAGGTGTT 


TGTGATCAGC 


AGACTCGCAA 


ATTCAACTCT 


GAAGCTTCTA 


240 


AAGAAGAGGG 


GATTGTGCTT 


ACAATTTCAG 


CAGACTTACC 


ATTCGCACAA 


AAAAGATGGT 


300 


GCGCTTCAGC 


AGGTTTAGAC 


AATGTCATTA 


CATTAAGTGA 


CCACCGTGAC 


TTATCATTTG 


360 


G TGAAAAC T A 


TGGCGTTGTT 


ATGGAACGAA 


CTTCGTGCCG 


AATTCGGCAC 


GAGCTCGTGC 


420 


AGTATTTGTA 


TTAGATGTAG 


ATAATAAAGT 


TGTTTATAAA 


GAAATCGTTA 


GTGAAGGTAC 


480 


TCGATGCCCC 


AGATTTTGAT 


GCTGCTTTAG 


CTGCATACAA 


AAATATTTAA 


TCATTAAAGA 


540 


GATAAATCTT 


AAAATGTATA 


CATCGTGTCC 


ATCGTTGTCA 


ACAGCATTAA 


AATAGAATTG 


600 


TTTTC T ATG A 


TTGCTAAGAC 


CTATGGGCAC 


TTTTTATTGG 


AG AGGGAC G A 


ATATGGCAGA 


660 


ACAACAAACA 


ATTATGGAAC 


GCTTGTTTCA 


TACATTAGAT 


GAAAAAGCTA 


AAA C ATT AAA 


720 


TAATGAAAAT 


GGCCAAAGTT 


TTATTGAAAA 


TCTTGGGCTA 


GCAATGGAAC 


AAGTATATAC 


780 


CAATGAA 












787 



20 

(2) INFORMATION FOR SEQ ID NO: 177: 



(i) SEQUENCE CHARACTERISTICS: 

25 (A) LENGTH: 568 base pairs 

{ B ) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

{ D J TOPOLOGY : 1 inear 

30 

(iij MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 



35 





GGCACGAGCG 


CGTCATATAA 


TGAAAGTAAT 


GATAAAAAGA 


AAGGATAACT 


TAATGTGAGT 


60 




CAAGAACGTT 


ATTCAAGGCA 


AATTTTATTT 


AAACAAATAG 


GTGAAATAGG 


TCAAAGCAAA 


120 




ATAAATCAAA 


AATGTGCGTT 


GATTATTGGT 


ATGGGCGCAT 


TAGGTACACA 


TGTGGCCGAA 


180 


40 


GGACTTGTTA 


GAGCAGGCAT 


TGCTAAACTA 


ATCATTGTTG 


ATAGAGATTA 


TATTGAATTT 


240 




AGTAATTTAC 


AAAGACAAAC 


ATTGTTTACT 


GAAGAAGATG 


CTTTGAAAAT 


GATGCCTAAA 


300 




GTGGTTGCAG 


CTAAAAAGCA 


TTTGCTAGCG 


TTACGTAGTG 


ATGTTGATAT 


TGATGGTTGT 


360 




ATTGCCCATG 


TGGATTATTA 


TTTTTTGGGA 


AACACATGGA 


CAGGACGTTG 


GACGTTATTA 


420 


45 


TTGATGCAAC 


CGATAACTTT 


GAAACACGAC 


AACTGATTAA 


TGATTTTGCA 


TATAAACATC 


480 




GTATTACCTG 


GATTTATGGC 


GGGCGTTGGT 


CAGAGTACAT 


ATTCAGGAAG 


CTGCATTTAT 


540 




ACCTGGNAAA 


CACCTGCTTT 


ACTGTTGG 








568 



50 (2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 573 base pairs 
55 <B) TYPE: nucleic acid 
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-(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



10 



15 



20 



(xi> 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 178: 






CTACNNTAAT 


AAGAAAATAT 


AACATACNAA 


TCAAAAACTA 


AAGGGATGTG 


ACGTTAATGA 


60 


AACTCGTATT 


TGTGGCACGA 


GCTGGTAATA 


TGGCACAAGC 


TATATTTACA 


GG AATTATTA 


120 


ACTCAAGCAA 


CTTAGATGCC 


AATGATATAT 


ATTTAACAAA 


TAAATCTAAT 


GAACAAGCTT 


180 


TAAAAGCATT 


CGCTGAAAAA 


CTAGGTGTTA 


ACTATAGTTA 


TGATGATGCG 


ACATTATTAA 


240 


AAGATGCAGA 


TTATGTTTTT 


TTAGGTACCA 


AACCGCATGA 


CTTTGATGCT 


CTAGCAACAC 


300 


GCATCAAACC 


ACATATCACA 


AAAGACANTT 


GCTTCATGTC 


AATTATGGCA 


GGTATTCCGA 


360 


CTGATTATAT 


TAANCAACAA 


TTAGAATGCC 


AAAATCCAGN 


TGCTAGAATT 


ATGCC ANAC A 


420 


CAANTGCGCA 


NGTTGGACAC 


TCAGTTACTG 


GCATTAGTTT 


TTCAAACAAC 


TTTGAACCCT 


480 


AAATCCTAAA 


GATTAAATTA 


ACGATTTAGT 


TAAAGCATTT 


GGTCTGTATT 


GAAGTATC C A 


540 


GAGATCATTT 


TACATCCAGT 


TACAGCTATC 


ACC 






573 



(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 619 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 



40 


CTCTATTATA 


AACATATTAA 


AACGCATTTT 


TCATGCCTAA 


TTTATC T AAA 


TATGCATTTT 


60 




GTAATTTTTG 


AATATCACCT 


GCACCCATAA 


ATAAAACAAC 


AGCATTATCA 


AATTGTTCTA 


120 




ATACATTAAT 


AAGAATCTTC 


ATTAATGAAC 


GATGCACCTC 


CAATTTTATC 


AATTAAATCT 


180 




TGTATCGTTA 


ATGCGCCAGA 


ATTTTCTCTA 


ATTGAGCCAA 


AAATTTCACA 


TAAGAATACA 


240 


45 


CGATCTGCTT 


TACATAAACT 


TTCTGCAAAT 


TCATTTAAAA 


ATGCTTGTGT 


TCTAGAGAAA 


300 




GTGTGTGGTT 


GAAATACTGC 


AACAACTTCT 


TTATGTGGAT 


ATTTCTTTCG 


TGCTGTGTCA 


360 




ATTGTAGCAC 


TAATTTCTCT 


TGGATGGTGT 


GCATAATCAT 


CTACAATAAC 


TTGATTTGCA 


420 




ATTGTAGTTT 


CATTGAAACG 


ACGTTTAACA 


CCACCAAACG 


TTTCTAATGC 


TTCTTTAATA 


480 


50 


TTTGTAACAT 


CTAGCTTCTC 


TAAATAACTA 


ATCGCAATTA 


CAGCTAATGC 


ATTTAAAACT 


540 




GTATGGTCAC 


CATATTGTGG 


AGACAGGAAG 


TGATCATAAA 


ACTCACCATC 


CACATACACA 


600 




TCAAAAGCAG 


TCTCGTGCC 










619 



(2) INFORMATION FOR SEQ ID NO: 180: 
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(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 443 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180 : 



15 


GGCACGAGCG 


TGTCAATATT 


TACATGATGA 


TTTCATTTAA 


TAGCCTTTTT 


TAGCATCAAC 


60 




TTCATTCTCA 


ATTAGACCAT 


TCTTATTGAG 


AAAATTAACT 


AGATTGTTTT 


TAAAAATATC 


120 




TAATAAGTCA 


TACTTTGCCT 


CATAATCATT 


ACCAGTTATA 


TGCGCTGTTA 


TAGTTACATT 


180 




TTCCAATTCA 


TATAATTCAT 


GATTAGGTTT 


CAAAGGTTCA 


TTTTCAAACA 


CATCTAAATA 


240 


20 


TGCATGTCGA 


ATAACTTTAC 


TTTTTAATAC 


TTCTATTAAG 


AGCGC TTCCT 


TTAACTATGC 


300 




TACCTCGTCC 


TATATTTATA 


AAAAGTGCTT 


CCATCCTTTC 


ATTAATTCCA 


AAATGTTTTT 


360 




TTCCTTTAGT 


TAAATGAATC 


CGTTTCCTTG 


CGTTTCCTGG 


TAAAGCATTT 


ATAATAATGT 


420 




CAGCATTTGG 


TAATGTGCTT 


TCA 








443 



25 

(2) INFORMATION FOR SEQ ID NO: 181: 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 181: 



45 



50 



GGCACGAGCT 


ATGTGGTTTG 


AAGTCAATCG 


CCAAATGTTC 


CACCTACCAT 


TCTGATAAAA 


60 


GTATTTACTT 


GTTCTTTATT 


CCATAACACA 


TATACTTTAT 


GATCTCTATT 


TTCAAATTGT 


120 


CTATGCACAT 


ATTTTTGTAA 


AGGATGCAAC 


TTTCCTTTTT 


CTTGC TTC AT 


TTCTACAAAA 


180 


TATGTTTTTC 


CTTCTGGCAT 


AATAATAATT 


CTATC TGGCA 


CACCTCTTGT 


TCCAGGTGCG 


240 


ACCCATTTTA 


AACATAAACC 


GTTTAGCTTT 


GTTATCTCTT 


TCACTAAATA 


TTTTTCTAAT 


300 


GTCGATTCCT 


TTCATTTATT 


CACCTTGTAT 


ACAAAATTTA 


TATTTGTGTT 


CCGAATGTTT 


360 


GTTATCAATT 


CCTTGCCAAA 


CTTTTTAAAA 


ATAGCTGTTT 


AGAGGGTTTA 


CCCCTATACC 


420 


CCCTTTACTC 


CCCTAAACAC 


TACTTTTTTA 


AACTTTATAG 


TGAATTTGAA 


TGCAACATTG 


480 


GGAAACAAAC 


AGGTTTGAAC 


CCCTACAGCT 


AGAAAGAAGA 


GTGTTTGTAA 


TCATTGTTGC 


540 


ATCATGTTGC 


ATCACCAAAA 


TGATACAAC 








569 



(2) INFORMATION FOR SEQ ID NO: 182: 
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(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 511 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



15 



20 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 182 : 






GGCACGAGCG 


CAAAATATTT 


CAAATAAAAA 


TGATTGTAAA 


AAGGCAAAAT 


ACAAAATTTC 


60 


ACTTAACAAC 


TAGTACATAA 


AGTAATACAA 


TTAAATTAAT 


TCTATCTGAA 


AGATGTGTGG 


120 


GGCATCGTTA 


TTTTAGGTGG 


ATATGAGCAA 


TTTATTAAAA 


GTCATTTACG 


GAAAATATAT 


180 


ATAGACGGGG 


TGAGTAATAT 


GCAAGAACAT 


GTGGTGGTTA 


CACTTGATGG 


AAAAGATTAT 


240 


CTTGTAGAAC 


CAGGTACGAA 


TTTACTTGAA 


TTTATTAAAT 


CACAAGATAC 


TTTTGTCCCT 


300 


TCAATTTGTT 


ATAACGAGTC 


GATGGGCCCA 


ATTCAAACAT 


GTGATACATG 


TACTGTTGAG 


360 


ATTGACGGTA 


AAATTGAACG 


CTCATGTAGT 


ACGGTGATTG 


ATCGCCCAAT 


GACTGTAAAT 


420 


ACTGTGAACA 


ATGATGTGAA 


AGATGCTCAA 


AAAGAGCCTT 


GATCCGAATT 


TTAGAAAAGC 


480 


ATATGCTGTA 


TTGGACAGTA 


TGTGATTATT 


A 






511 



(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1125 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 





ATCCTTTATT 


TNTAAGGCGT 


TTCATCAAGC 


TAAC AC TTC A 


TTATCTTTAG 


TCGCTTTAAT 


60 




GCCCTTCTAT 


TAAACTCGGA 


ACTAAAAATG 


GATTGNCTTT 


TCAACCACCC 


GANCACTAAA 


120 


45 


ACATTGTCNT 


TTTTGATGGC 


CNATTAAGAC 


ACAATTTCCC 


GCTGTTTGCT 


TCAAAGTAGC 


180 




TTGCTTCTTG 


NATTNATTTT 


CAATATCTTT 


CTTGTTAAAA 


ACAAGANTGT 


TGCACAGTTT 


240 




GATTGGCATC 


TTTATTTAGA 


NCAATGGCAT 


CTGCTTGCCA 


CTTATCAATG 


CCTTCTTTAT 


300 




TCATATTGAT 


AAGACCATTC 


GCCAATCCAG 


ATAATAAAAA 


TAGCAAGTAA 


CTAATCATCG 


360 


50 


TTAACACACC 


AATAATTAGT 


CCAAACTTCA 


ATTTGTTGCG 


CCGTATTTCA 


TTCCAAGCTA 


420 




AAAACATGCA 


TTTCTCTCCC 


TACTACTATG 


ATTTAAACAT 


TGTTTATATT 


CTTAGATGCA 


480 




CGTACGTCGT 


GTTGCGCTCT 


GTAATGTTAT 


ACATACACTT 


ATCCTTCATT 


ATACCCGANC 


540 




TTTTTATATT 


AAAACCAAAT 


TTATGGAAAA 


TGCAANNANT 


TGTCTATTAT 


TTTTGTGCGG 


600 


£5 


TACATTTAAA 


ATTAAGGATC 


AATTTAAAAA 


CGCCTACATA 


TACCTTTAAG 


TACATGAAGA 


660 




CGTCCAATTC 


ATATATTATT 


TAACTTCGCC 


TGTTTTAGGA 


TCGGATTGCT 


TAATAGCATT 


720 



148 



EP 0 841 394 A2 



TTTACGTAAT TTATCTTTTG CTTTGTCACT TGCTTTATAG TTATTGTTGT AAATCGTAGC 780 

TTCCCAACTA CCATACATTG GGTTAGGGAA AATGATATAT TTCTTACCGA AATCGTCTTT 840 

ATGTTTTTCA ATTAATGCTT CACGAGATTC AGCTGT AGC T TCTTTTGGAT CTGTAAAGTC 900 

TAATAAATTA TCTCCAAATA GCATGACAAG TTTATGATCC TTTTGAACCA TTTGTCTGCG 960 

TGATTC TTTA CTCTTATCAT CTTTACCTTT TAGTAAAATA TGACTCTTCT TAGCTTGAGG 1020 

GATACCTTGT TGTTTTAAGT TCTTTTGTGT TGCCTTTAAA TCTTTTTCTT TATCTCTATC 1080 

AGAAATATAG TAGATATCGA CACCTTTTTT GTCAGCATAT TTCAA 1125 

(2) INFORMATION FOR SEQ ID NO: 184: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 





AGAAGCATAT 


AAACAAAACA 


TGTTCGCATT 


ATTAGG T AAA 


ACTGGTTTTG 


AAGACTTGAA 


60 




AAAAGAATTA 


GAAGAGCGTT 


TATAAAATAC 


ATTACTTCAA 


TGATTAGTGA 


AGTTTGAAAA 


120 


30 


GATAGAACTA 


GACGTTAACT 


ATTTAAAGCA 


TATTTTCGAG 


GTTGTCATTA 


CAAATGTAAA 


180 


AATGTAATGA 


CAACCTCGTT 


TTTATTTATA 


TGCAAGAACT 


AGGTTAC TAG 


CTAATGTGAC 


240 




AAGATGTTAA 


GAGAAAATTA 


AAGAAAAAAT 


AACATCTGTC 


ATACAATAAT 


ATTGTTATAC 


300 




T AC T AGAGAC 


TGATTTATTA 


GCATGATTAC 


ATGTTAATGT 


TTCTTTACTT 


AGTAATTAAC 


360 


35 


TTTATAATGT 


AAGAATAATT 


ATCTTCAACC 


AAAGAAAGGG 


ATTGATGATT 


TGTCGTTTCA 


420 


TCAAGTAGAA 


GAATGGTTTG 


AGATATTTCG 


ACAGTTTGGT 


TATTTACCTG 


GATTTATATT 


480 




GTTATATATT 


AGAGCGATAA 


TTCCAGTATT 


TCCTTTAGCA 


CTCTATATTT 


TAATTAACAT 


540 




TCAAGCTTAT 


GGACCTATTT 


TAGGTATATT 


GATTAGTTGG 


CTTGGATTAA 


TTTCTGGAAC 


600 


40 


ATTTACAGTC 


TATTTGATCT 


TGTAAACGAT 


TGGTGAACAC 


TGAGAGGATG 


CAGCGAATTA 


660 


AACAACGTAC 


TGCTGTTCAA 


ACGCTTGATT 


AGTTTTATTG 


ATCGCCAAGG 


ATTAATCCCA 


720 




TTGTTTATTT 


TACTTGGGNT 


TTC CTTTTAC 


GCCAAATACA 


TTAATAAATT 


TTGTAGCGAG 


780 




TCTATCTCAT 


ATTAGACCTA 


AATATTATTT 


CATTGTTTTG 


GCATCATCAA 


AGTTAGTTTC 


840 


45 


AACAATTATT 


TTAGGTTATT 


TAGGTAAGGA 


AATTACTACA 


ATTTTAACGC 


ATCCTTTAAG 


900 




AGGGATATTA 


ATGTTAGTTG 


TGTTGGTTGT 


ATTTTGGATT 


GTTGGAAAAA 


AGTTAGAACA 


960 




GCATTTTATG 


GGATCGAAAA 


AGGAGTGACA 


TCGTGAAAAA 


AGTTGTAAAA 


TATTTGATTT 


1020 




CATTGATACT 


TGCTATTATC 


ATTGTAC TGT 


TCGTACAAAC 


TTTTGTAATA 


GTTGGTCATG 


1080 


50 


TCATTCCGAA 


TAATGATATG 


TCGCCAACCC 


TTAACAAAGG 


GGATCGTGTT 


ATTGTAAATA 


1140 




AAATTAAAGT 


TACATTTAAT 


CAATTGAATA 


ATGGTGATAT 


CATTACATAT 


AGGCGTGGTA 


1200 




ACGAGATATA 


TACTAGTCGA 


ATTATTGCCA 


AACCTGGTCA 


ATCAATGGCG 


TTTCGTCAGG 


1260 




GACAATTATA 


CCGTGATGAC 


CGACCGGTTG 


ACGCATCTTA 


TGCCAAGAAC 


AGAAAAATTA 


1320 


55 


AAGATTTTAG 


TTTGCGCAAT 


TTTAAAGAAT 


TAGATGGAGA 


TATTATACCG 


CCTAACAATT 


1380 




TTGTTGTGCT 


AAATGATCAT 


GATAACAATC 


AGCATGATTC 


TAGACAATTT 


GGTTTAATTG 


1440 



149 



EP 0 841 394 A2 



10 



30 



35 



ATAAAAAGGA 
TTCAGTTCAA 
TTTCAATTGC 
ATACAATTAA 
ACATTATTGG 
ACAAAAATGA 
AAAATGATAC 
TAAAACATAA 
GCGAATCCCA 
GTGAAGTAAG 
AAGTTTCATT 
CTAAAAATTA 
GTTTTATTTG 
TCACAATCAC 
ATGCTTATTT 
AAAAAATGAA 
CATTTCAATT 
AAGTGTTGCA 
AACAAAAGTT 
AAAAGTATTT 
CAGAACAAAT 
TTGCTGATAA 
ACCGTGAGTT 
TATTTTATTG 
GGTTTTCACA 
AAGAGTGTCA 
AAACCATCGG 
GAACGTCAAT 



TATTATTGGT 
ATCTTAAAAA 
AGTCGCTTTT 
AGGTGAATCA 
ATATAAAACA 
TGACTATGTT 
ATTATATGTC 
ACAAGGTGAT 
AATCAAATGT 
TAAAGATAGC 
TAGATTCTGG 
ATATGAAACA 
GTTAGTACCA 
TTTGTGTTAA 
AGGTAGAGCG 
AGCAGATCCG 
AGAACAAGCC 
TTTTGAACGA 
ATCTAAAGCT 
AAAAC TTTAT 
TCAAGATTTT 
AAATATGCAA 
CGAACAACGC 
ATTGTATGCC 
AC TTTTC AAC 
CAATTATATT 
AAGTGTTACG 
ATTTCAACCA 



AATATAAGTT 
GAGGTGTCAA 
GTCATTTTAT 
ATGGATCCAA 
GGTGGTTTGG 
AAACGTGTCA 
AATGGTAAAA 
TACATTACTG 
CAATCCAAAA 
CGTGCGTTTG 
CCATTTAGTG 
AATACAACAT 
CAATAAAAGC 
AATATGTGTC 
GGAACAGGTA 
CTTGGAGATC 
TTTGTCAATG 
TTAAGTCATC 
GCAACGGAAA 
CAATCACAAG 
AAAAAATATG 
AC TCGAACTA 
ATNCCANAAC 
GAAATCAGAG 
GATTGAGTAT 
GACGACAGAT 
ACATATTGAA 
ATTATATCGC 



TGAGATATTA 
AATTGAAAAA 
TTATAGTAGG 
CTTTGAAAGA 
AAAAAGGTAA 
TCGG TGTTCC 
AACAAGATGA 
GGACTTTCCA 
GGGTAAATAT 
GCCTCATTGA 
AATTTAAACA 
CGTTTGTCGG 
TAAGTTCGAA 
AAAGGAAGTG 
AGTCTACGAA 
CAATCATTTT 
ATCCGGAATT 
GTATTTTCCA 
TGATGATTTA 
CAAAATATTA 
CAGTAACGCC 
AAAATAAGTT 
GAGTTTATTA 
TGGCT AAAAC 
TTAATAATCA 
GGTAACCACG 
GAAATAGCAA 
TTCAATAATC 



TCCTTTTTCA 
AGAATTATTG 
TAAATTTATT 
TGGCGAGCGA 
TGTAGTTGTC 
TGGTGATAAA 
ACCATATTTA 
AGTTAAAGAT 
TTAGCTCTTG 
TGAAGACCAA 
TAATTTCAAT 
TTTTAATACT 
ATGAACTTAT 
AGGGTTTGTC 
AATGTTGACC 
AATTGCGCCA 
AAATGGTAGT 
AGAAGTTGGT 
TAACATTGTT 
TGGGTTTAGT 
TGAACATTTA 
AGAGGATATT 
CTGNTGAGGA 
GTGCTGATAG 
AAGGATTAAT 
ATCAATTTAG 
ATGAACTCAA 



AAATGGACGA 
GAATGGATTA 
GTTACACCAT 
GTAGCTGTAA 
TTCCATGCAA 
GTAGAATATA 
AACTATAATT 
TTACCCGAAT 
GAGGATAATC 
ATTGTTGGTA 
CCTGAAAATA 
GATAAACGAT 
AATAAATCAA 
ATGACATTAC 
GAAATAAAAC 
ACTCAAAGTA 
TTAAGAACAG 
AGTTATAGCG 
CAAGAACAAC 
GAAAAATTAA 
GAACACTTTA 
GC TTTAAT AT 
TTCATNACAA 
ATATATTGAT 
TAAATATGCG 
TTTATTTAGA 
TATTTCTATT 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3100 



(2) INFORMATION FOR SEQ ID NO:185: 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 



GGTGGC GAG A AAAT AC CGC A AGGTCATAAA GATATCTTTG ATCCAAACTT ACCAACAGAT 60 
CAAACGGAAA AAGTACCAGG TAAACCAGGA ATCAAGAATC CAGACACAGG AAAAGTGATC 120 
GAAGAGCCAG TGGATGATGT GATTAAACAC GGACCAAAAA CGGGT AC AC C AGAAACAAAA 180 
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5 



10 



ACAGTAGAGA 


TACCGTNTGA 


AACAAAACGT 


GAGTTTAATC 


CAAAATTACA ACCTGGTGAA 


240 


GAGCG AG TG A 


AACAAGAAGG 


ACAACCAGGA 


AGTAAGACAA 


TCACAACACC 


AATCACAGTG 


300 


AACCCATTAA 


CAGGTGAAAA 


AGTTGGCGAG 


GGTCACCCAA 


CAGAAGAGAT 


CACAAAACAN 


360 


CCAGTAGATA 


AGATTGTAGA 


GTTCGGGTGG 


AGAGAAACCA 


AAAGGTCCCA 


AANGGACCTG 


420 


AAAACCCAGA 


GAAGCCGAGC 


AGACCAACTC 


ATCCANGTGG 


GCCAGTAAAT 


CCTAACAATC 


480 


CAGGATTATC 


GANAGACAGA 


GC AAAACCAA 


ATGGCCCAGG 


TCCATTCAAT 


TGGATAAAAA 


540 


TGATAAAGGT 


TAAAAAATCT 


AAAATTGCTA 


AAGAATCAGT 


AGCTAATCAA 


GAGAAAAAAC 


600 


GAGCAGAATT 


ACCAAAAACA 


GGTTTAGAAA 


GCACGCAAAA 


AGGTTTGATC 


TTTAGTAGTA 


660 


TAATTGGAAT 


TGCTGGATTA 


ATGTTATTGG 


CTCGTAGAAG 


AAAGAATTAA 


AATAATTCAT 


720 


AATTTAAATA 


ATAGTTGATT 


TGCATTCACT 


ATATTTAGTT 


TGTTAAAAAC 


AACCTAGAAT 


780 


ATGATGAGAA 


TGATATACAA 


CCCCAAAAGT 


TGGCTTG 






817 



IS 

(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1348 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

30 



35 



40 



45 



SO 



55 



CCATCGACTA 


AG C AATGGTA 


TTTGTTCATT 


AAATCCTCAT 


GTTGATCGTT 


TAACGTTAAG 


60 


CTGTCGCATG 


GAAATCGATG 


CTAGTGGTCG 


C GTTGTT AAA 


CATGAAATTT 


NTGATAGTGT 


120 


TATACATTCT 


GATTATCGAA 


TGACGTATGA 


TGCGGTAAAT 


CAGATTATTA 


CTGAAAAGGA 


180 


TCCTAACATT 


CGCGAACAAT 


ATAAAGAAAT 


TACGCCTATG 


TTAGATTTAG 


CACAAGATTT 


240 


ATCTAATCGN 


TTGATTCAAA 


TGAGAAAACG 


ACGTGGGGTG 


AAATCGATTT 


TGGATATTAG 


300 


NGAAGCAAAA 


GTATTAGTTA 


ACGAAGTCGG 


GTATACCAAC 


AGATGTTCAA 


TTAAGACAAC 


360 


GTGGCGAGGG 


TGAACGTCTA 


ATTGAATCAT 


TTATGTTAAT 


TGCAAATGAA 


ACAGTTGCTG 


420 


AACATTTTAG 


TAAGTTAAAT 


GTAC CTTTT A 


TTTACCGAGT 


GCATGAGCAA 


CCTAAATCAG 


480 


ATCGCTTAAG 


ACAATTCTTT 


GATTTTATTA 


CAAACTTTGG 


CATCATGATT 


AAGGGCACTG 


540 


GCGAAGATAT 


TCATCCAACA 


ACACTTCAAA 


AGGTTCAAGA 


AGAAGTAGAA 


GGTCGACCTG 


600 


AACAAATGGT 


CATTTCAACA 


ATGATGTTAC 


GTTCAATGCA 


ACAAGCGCAT 


TATGATGATG 


660 


TGAACTTGGG 


ACATTGTGGC 


TTATCAGCTG 


AATATTATAC 


GCATTTNACA 


TCACCAATTA 


720 


GACGTTATCC 


TGATTTAACA 


GNTCATCGTT 


TAATCCGTAA 


GTATTTAATT 


GAGAAATCAA 


780 


TGGATAACAA 


AGAAGTGAAG 


CGTTGGGAAG 


ACAAATTGCC 


TGAGTTAGCT 


GAACATACTT 


840 


CTAAACGTGA 


ACGTCGTGCT 


ATTGAGGCAG 


AACGTGATAC 


TGATGAATTG 


AAAAAAGCAG 


900 


AATATATGAT 


TCAACATATT 


GGTGATGAAT 


TTGAAGGTAT 


TGTCAGCTCA 


GTAGCTAACT 


960 


TCGGTATNTT 


CATTGAATTG 


NCAAATACGA 


TAGAAGGTAT 


GGNTCATATT 


GCGAATATGA 


1020 


CTGATGATTA 


TTACCGCGTT 


GAAGAGCGTC 


AAATGGCATT 


AATTGGGTGA 


GCGTCAAGCT 


1080 


AAAGTATTTA 


GAATTGGTGA 


CACAGTTAAG 


GTTAAAGTGA 


CGCATGTTGA 


TGTAGATGAA 


1140 


CGATTAATTG 


ATTTTCAAAT 


TGTTGGAATG 


CCTTTACCTA 


AAAATGACCG 


CTCACAGCGA 


1200 
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CCAGCAAGAG G T AAAACG AT TCAAGCTAAA ACGCGTGGCA AATCTTTAGA TAAATCGAAA 1260 
TCTGATGATA AGGG TCGGT A AGAAAAAAGG TAAGCAACGT AAGGTAAAAA CCAACGTAAT 1320 
AATGATAATC AGGTAATAGT AAGCATAG 1348 

(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 982 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



20 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 187: 








ATGAAAATTA 


TTTTAATTTT 


AGCATTAGCG 


CGTGTCGTTT 


CTAGACATAA 


TCAATTCACA 


60 




TTCAATAAAT 


CATTCCAAAG 


TGATTTGTTA 


TTATTTTTCA 


AAATTATTGG 


TGTCTCGTTA 


120 


25 


GTACCAAGTA 


TTTTAATATT 


ACTGCAAAAT 


GACCTAGGAA 


CTACATTAGT 


ATTAGCTGCT 


180 




ATTATTGCAG 


GTGTGATGTT 


AGTAAGTGGT 


ATAACATGGC 


GTATCTTAGC 


ACCTATCTTT 


240 




ATTACAGGTA 


TTGTTGGTGC 


AATGACAGTC 


ATTTTAGGTA 


TTCTATATGC 


ACCCGCATTA 


300 




ATTGAAAATT 


TATTAGGTGT 


CCAACTGTAT 


CAAATGGGAC 


GAATCAATTC 


AAGGCTTGAC 


360 


30 


CCCTATACAT 


ATAGTAGGGG 


GGGGGATGGC 


TATCAATTAA 


CTGAATCACT 


TAAAGCTATC 


420 




GGGCTC TGGA 


CAAGGTACTA 


GGTAAAGGAT 


ACAATCACGG 


GGGAGGTTTA 


TATTCCCTGA 


480 




AAATCATACT 


G AC TTTTATC 


TTTNCAAGNG 


AATGGGAGAG 


GAACTTGGCT 


TTATCGGTTC 


540 




TGGCAAATTG 


AGNCTTAATA 


TTTTTATTTT 


TAATCTTCCA 


TCTAATAAGA 


TTAGCTGCGA 


600 


35 


AAATTGGAGA 


TCAATTTACC 


AAAATCTTTA 


TCGTTGGTTT 


CGTCACTTTA 


CTTGTGTTCC 


660 




ATATTTTACA 


AAATATTGGT 


ATGACAATTC 


AGTTGTTACC 


AATCACTGGT 


ATTCCATTAC 


720 




CATTTATTAG 


TTATGGTGGT 


AGTGCGCTAT 


GGAGTATGAT 


GACTGGAATA 


GGTATAGTCT 


780 




TATCAATTTA 


TTATCATGAA 


CCAAAACGAT 


ATGTCGATTT 


ATACCATCCA 


AAAAGTAATT 


840 


40 


AATTTAAACT 


ATTTTGAGTT 


TCAAATATCA 


TAACTTTTCA 


AGATGACGTT 


ATATAGTCTA 


900 




TTTACGTCGT 


CGATTTAAAA 


TGTCATATAT 


AGATATTACT 


CGATAATAAC 


AATCCCTCTT 


960 




TGAAGTACAC 


ATTGTAAAAT 


GG 








982 



45 (2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1709 base pairs 
so (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 



5 



10 



15 



20 



25 



30 



35 



GGCACTAACT 


TCATTTGAGC 


ATCCTCCTAT 


CAATTGCTAT 


ATAAATTAGT 


ACCCTTTTGC 


60 


CACTTAATTA 


TAACAAATTC 


TCAAATTTTA 


AAAATTGAAA 


ATCTAGTTAA 


TGTATTAGCT 


120 


CGATTTTGAA 


ATCTAATAAT 


AATTGGCATA 


AAATGGAAGT 


AATATTATGT 


GAGGAGTGTT 


180 


ATGAAATGAC 


AAAAATATCA 


AAAATAATAG 


ACGAACTGAA 


CAATCAACAA 


GCTGATGCAG 


240 


CATGGATTAC 


AACACCGTTG 


AATGTATATT 


ATTTTACTGG 


ATACCGTAGC 


GAACCCATGA 


300 


AAGATTATTT 


GCATTATTGA 


TTAAGAAAGA 


TGGTAAACAA 


GGTACTATTT 


TGTCCAAANA 


360 


TTGGAAGTCG 


GAAG AAA TC A 


AAGCATCACC 


CTTCACAGGT 


TGAAATCGTT 


GGATATTTAA 


420 


GACACTTGAA 


ATCCTTTTTC 


ACTTTATCCA 


CAACNCAATC 


AATAAATTAC 


CTAATTGTAA 


480 


GCGGAGCACC 


TTAACAAGTA 


GCACCGCCAC 


AAACAATTAA 


TCTCTGTTTC 


NATGTCAATT 


540 


CATTCGGAGA 


TGTTGAGTTA 


ACAATCACAC 


AATTAAGAGA 


TATTAAATCC 


GAAGATGAAA 


600 


TTAGCACAAT 


AC GT AAAGCT 


GCTGAGTTAG 


CAGATAAGTG 


TATCGAAATA 


GGTGTTTCTT 


660 


ATTTAAAAGA 


AGGTGTGACT 


GNACGTGAAG 


TAGTCAACCA 


TATTGAGCAA 


ACTATCANAC 


720 


AATATGGCGT 


CAATGAAATG 


AGTTTTGATA 


CGATGGTTTT 


ATTTGGAGAT 


CATNCCGCAT 


780 


CACCTCATGG 


CACACCAGGA 


GATCGCAGAT 


TAAAAAGCAA 


TGAATATGTA 


CTATTTGATT 


840 


TAGGTGTAAT 


TTATGAGCAT 


TATTGTAGCG 


ATATGACACG 


TACCATTAAA 


TTTTGGTGAA 


900 


CCTAGCAAAG 


AAGCACNAGA 


AATTTATAAT 


ATTGTATTAG 


AAGCAGAAAC 


ATCTGCAATC 


960 


CAAGCAATTA 


AACCTGGAAT 


ACCTTTAAAA 


GATATCGATC 


ATATCGCTAG 


AAATATTATT 


1020 


TCAGANNAAG 


GTTATGGTGA 


ATATTTCCCT 


CATCGCTTAG 


GTCATGGTTT 


AGG ATT AC AA 


1080 


GAACATGAAT 


ATCAAGATGT 


TTCAAGTACT 


AATTCTAATT 


TGTTAGAAGC 


TGGCATGGTT 


1140 


ATTACAATCG 


AACCAGGTAT 


TTATGT AC C A 


GGTGTTGCAG 


GTGTAAGAAT 


TGAAGATGAC 


1200 


ATACTTGTCA 


CTAATGAAGG 


ATATGAAGTA 


TTAACACATT 


ACGAAAAATA 


AGGAGTGGGA 


1260 


TAAAAATGAA 


AAGCTTGTTA 


CAAGCACATT 


CTCATTCAGT 


CAAACACTGC 


CAATATAACA 


1320 


TTGTAGCGCC 


TAAGACATAA 


ATTTTTATCC 


AAGTCTAAAT 


GC AATATGTA 


ACAAACAAGC 


1380 


TAGAAACACA 


TATGCAGGTA 


TGTTCATCAG 


TAACATGTAA 


TGAATCAAAT 


CAATATCATT 


1440 


CATGTTCGAT 


GATTTCTTCG 


CATTGTTTCT 


AGCTTTAATT 


TATCATTATT 


TAATTTTAAT 


1500 


AACCAAGGAG 


ATGATAACGT 


CAGTCTTTAG 


TACGC TGTAA 


TCCATTCCCT 


TTTCATCAAA 


1560 


TTCAAATTAT 


AATTGTAATG 


CTTCTTCTAC 


AGATTTATAT 


TCCATTTCAA 


ATGCCTCTGC 


1620 


AACGCCTTTA 


TTGGTTACGT 


GACCTTTGTA 


AGTATTTAAA 


CCTAATGATA 


ATGGTTGATT 


1680 


TGATTTAAAT 


GCTTCTCTAT 


ACCCTTTAG 








1709 



(2) INFORMATION FOR SEQ ID NO: 189: 

45 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1355 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
so (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 



153 



EP 0 841 394 A2 



5 



TO 



15 



20 



25 



AAAGAGGAAG 


ATGTGAACCA 


CCACTAGTAT 


TG AATC T AAA 


GATTGAGGAC 


GTTCCTTTAT 


60 


T AATTGGC T A 


NNNAAGNNAN 


ATCNNNAAGA 


TAGCCAATCC 


AAAGCCAAAA 


AGTCAGCATC 


120 


AAGAAATACT 


TCTANNAAGG 


TAGCAGCTNN 


NNAGAAGNGN 


NAGAAATCTA 


AGANAAATNN 


180 


NAAATAATTT 


GTTTCTTTGC 


TAAATAGAGG 


AGC AC CGATT 


GACATCACAT 


CAGTCGGTGC 


240 


TCCTTTTATT 


TATTCTTTTT 


AATTAATTTA 


TACAAANCCN 


ATTCCCTGTT 


GAGCGTGTTG 


300 


AATCGCCTTC 


CNTTGTTTTG 


TTCTCNCGGT 


ANCCCATTTA 


ACCAAATNAT 


AAAC CNANAT 


360 


CTTTNTCCAA 


ATATTTCTAT 


TTGATCAAAA 


TAAGGTTTGA 


AATTTGCGTT 


TTTC AC AT AA 


420 


CCAGCTCGTG 


CCAATGCTAT 


CGTGCAATTA 


GCTTTGAGTC 


TGTATATAAT 


AGTGCGTTTT 


480 


GAACATTTAA 


TTCACGTGCA 


TGTTCTAGTG 


CATAAATACA 


TGCAGCCCAT 


TCTGCAGTGT 


540 


GGTTATCCAT 


TTCGCCTAAC 


TCATGTGTAT 


ATGTATNATG 


CTGCTTATCT 


TCTTTGATTA 


600 


CAATGGCACA 


NGTACTTATG 


CCTGGATTTC 


CTNTTCGTCG 


CAGCATCAAA 


ATTTATGTGC 


660 


GCCATAATAA 


ACCTACTTTC 


TATTCAATAC 


TTAGTTAAAG 


TTACTATTAC 


TGTAATACAA 


720 


AATATGTTGG 


GTAATCCATT 


AAAAAACACG 


CATCACTTAA 


ATAAGTAAAC 


ACGTTGTTAA 


780 


AAATACTTCG 


CTTGATTCAA 


AAGATGATTT 


TCTAAATACG 


TAGTNCTTGT 


AAAAT AC TTC 


840 


CTAAANAAAT 


CATCTTCAGG 


CTGGGGACAT 


AAATCAATGT 


TCTATGCTCC 


TNCCGAAGTT 


900 


ATATTGGCAG 


TAGTTGACTG 


ANCGAAAATG 


CGCTTGTAAC 


AAGCTTTTTT 


CAATTCTAGT 


960 


CAGGGGCCCC 


AACACAGAAG 


CTGNCGAAAA 


GTCAGCTGAC 


AATAATGTGC 


AAGTTGGGGA 


1020 


TGG AC CC CAN 


CAAAGAGAAA 


TTGTATTCCC 


AAATTCTACA 


GACNATNCAA 


GTTGGGGTGG 


1080 


GNCGACGAAA 


TAAATTTTGC 


GAAAATATCA 


TTTCTGTCCC 


ACTCCCTTAA 


AACTTATTCT 


1140 


TTTGTGTAGT 


AAGTGCGTTA 


ATAGCCTTGA 


TC TAACTT AT 


CAATCTTACC 


TTTACGATAA 


1200 


AATGATTTAG 


CAATATATCC 


ANNTGGTACA 


TTGAAAACTG 


TTGAAGCTAA 


TTTTAATACG 


1260 


TAAGTTGTAA 


TAAATATTTC 


NAATNCAACT 


GTACCAGGTA 


AACTTCCGAT 


AAAAGCGATA 


1320 


GCTACAAATA 


AAGCTGTATC 


CAATTATTGA 


GCTTG 






1355 



(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



45 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 190: 








CCAATTTTCC 


CTTCTACCTG 


CGTGGGTAGT 


ATTCAAGCCC 


AAAGTTCAGC 


GGG AAAATAT 


60 




TGACAATTAA 


CCGGATCCAG 


CTAATCGGAG 


GTCGGATAAA 


GATGTTGAGT 


TGTAAGTTAT 


120 


50 


TTGGAATATT 


ATTTTAATAG 


TGTCATCCCC 


CTTTGTAAAA 


TAATTGTCTT 


ACTTTTAAAT 


180 




TAAAAGCCAA 


ATTAATATAA 


GAAANCTAAG 


ACTTAGTACN 


GTATCAATTT 


TGTGCGTTTC 


240 




AATTGAG TTC 


TAGTTTTTTT 


TAATATGTTA 


ATATTAAACT 


TATAACTTTA 


TGGGAGTGGG 


300 




ACAAGAATGA 


TAAAGAGCCA 


CTAAATGATT 


T ATT ATG TAG 


TGGTTCTTAA 


T AC ATT AGCC 


360 


55 


ACANCTAATG 


TGTACTTAAA 


AATAGGAATA 


CATGAGTAAA 


ACTCATGCAT 


AAGAAATACT 


420 




AATTTCTATA 


GAAAAAGTAT 


TACTTTATCG 


TTGTACCACA 


CCAACTTGCA 


CATTATCGTA 


430 
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5 



10 



15 



20 



AGCTGACTTA 


TCGTAAGCTT 


CTG TGTTGGG 


GCCCACACCC 


CAACTCGCAT TGCCTGTAGA 


540 


ATTTCTTTTC 


GAAATTCTCT 


TTGTTGGGGC 


CCACACCCCA 


ACTTGCATTG 


TCTGAAGAAA 


600 


TTGGAAATCC 


AATTTNCTCT 


GTGTTGGGGC 


CCACACCCCA 


ACTCGCATTG 


CCTGGAAGCT 


660 


GAATTTCTTT 


TCGAAATTCA 


GCTTCTGTGT 


TGGGGCCCAC 


ACCCCAACTT 


GCATTGCCTG 


720 


TAGAAATTCT 


TTTCGAAATC 


CAATTTCTCT 


GTGTTGGGGC 


CC CTG AC TAG 


GATTGAAAAA 


780 


AGCTTGTTAC 


AAGCGCATTT 


TCGTTCAGTC 


AACTACTGCC 


AATATAACTT 


CGTAGAGCAT 


840 


AGAACATTGA 


TTTATGTCCC 


AGCCTGATAT 


CACCATTAAA 


TACAATTCAT 


TTAGTNTTCA 


900 


ATTGGAAACA 


ATTNATCGAT 


ATATTGAATC 


TCATCATCTG 


ATAAAACGAT 


ATCTGCAGCT . 


960 


TTAATATTTT 


CAATGACTTG 


TTCTGCACGT 


TTTGCACCAG 


GAATAATCAC 


ATCGATAGCT 


1020 


GGTCTCGTTA 


AATAAAATGC 


TAATACAATG 


TTCGCAATTG 


AAGTTTGATG 


TGCTGCAGNT 


1080 


AGNCTTTCCC 


AAAGCTTTTA 


CGCTCGTGAC 


GCACATTCTC 


TTCAAAAACA 


CCAGGTATAA 


1140 


AATCCCGACG 


TGTAGTACGA 


TGGTCACTAA 


ATTTAGTGTT 


CTCATCATAT 


TTTCCAGCTA 


1200 


AAATACCGGA 


TGCTAATGGG 


AAATAAGGAA 


TAAATGTGAT 


TAGGTGATCA 


ACACAATATT 


1260 


GCAATACTGC 


CTCATTTTCG 


CGGTGCAATA 


AATTATATTC 


TAACTGTACA 


ACATCAACGT 


1320 


AACCATCTTT 


ATTTGCTTCT 


TTAAGTTGAT 


CTAATGTGAA 


ATTTGATACA 


CCAATGGCTT 


1380 


TAATCTTCCC 


TTGTTCCTTA 


AGCTCGTGCC 








1410 



(2) INFORMATION FOR SEQ ID NO: 191: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2437 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

30 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 



40 



45 



50 



55 



GTGTTACGGT 


AGGTTGTTGA 


TCCTAATTTT 


CCCTGNCAGT 


TGGTCCCAGT 


ACCCACCCAT 


60 


TTCCCAACTT 


ACTAAAAAAG 


AAATCCAATT 


ANNCCGAAAG 


AAAATATTGA 


AGTCATTGAA 


120 


ATTAACCGAA GCGTTCCAGT 


GCACAGGTAG 


TTGCCTCCCC 


AACCAAGCTT 


TAAATATTTC 


180 


CAAATACGCA 


ATTAAATATA 


TGGGG TGGTG 


CATTAGCATC 


AGGTCATCCA 


TACGGTGCAA 


240 


GCGGTGCCCA 


ATTAGTGACT 


CGATTATTTT 


ATATGTTTGA 


CAAAGAGACT 


ATGATTGCAT 


300 


CTATGGGGAT 


AGGGGGAGGT 


CTAGGAAATG 


CAGCATTATT 


TACTCGATTC 


TAACCAGCGA 


360 


TTAAATGTGT 


CATTTTCTAA 


GGATAGTGTG 


GCTGCATATT 


ATCAGTGTTT 


TANCCAACCT 


420 


TATAGAAAAG 


AAGTCTCGTG 


CCCATTAATG 


TGTNCGTCAT 


TATGGCCACA 


ATTTGATTTA 


480 


TTTAAAAAAA 


TATCCAAATA 


GCGAGCTGAT 


TTTACCACAA 


ATCAGCAATT 


AATCAAACTC 


540 


NCAAGATAGA 


AGTAGACACA 


ATATATGTAG 


GGCATTTAGA 


AGATATTGAA 


TGCCGACAGA 


600 


CTCGCAATAT 


CACACGTTAT 


AC AATGGC TT 


TAACATTAAC 


TAAAAATGAT 


CAACATGTCA 


660 


TANCGGTTAC 


ACAAACTTTT 


ATTAAGGCGA 


TGAAGTAGAG 


ATGAAGTTTA 


ATGAGATATG 


720 


GATAAATGAA 


TATTTGGCGC 


TCGTAAATGA 


TGATAATCC A 


ATACATAATG 


AGATTGTGCC 


780 


AGGACAATTA 


GTGAGTCAAA 


TGATGCTGAT 


GGCTATGTCA 


TTAGAGACAA 


ACCAGTGTCA 


840 


AATTAACTAC 


GTTAAACCTA 


TTTTAATAAA 


TGAAAATATC 


GAATTCATTG 


AACAACACGA 


900 
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5 



10 



15 



20 



25 



30 



ACACGAAATT 


ATAGCAATTA 


ATGACGATGG 


AGAGATTAAA 


ATAAAAATTT 


CTTTGAGCAC 


960 


AAAAAAATAA 


C CG AT ATT AG 


CTGCATGAAC 


GCATATTAAT 


TAGGAGATGA 


AAGGACAGCT 


1020 


AATATCAGTT 


ATGTATTGTT 


ATTATTATTG 


GGAACAGAGA 


TGAATATAGG 


TTACGTTTCT 


1080 


TTCTTTGCAC 


GGGGATGCAT 


TAATCTAAAA 


TAATAATAAC 


AACTATATCA 


ATGTTTAATA 


1140 


AAT TCTGGAT 


TATTGGAACG 


ATTAGTCAAT 


TTAACTAACT 


TNC AT ATGAT 


CTATATCGTC 


1200 


TTGTNATAAA 


GAGAGCAATT 


TGAATATTTC 


AGTATCACTA 


AATGAATCGN 


CACATTTAAT 


1260 


TGAAACATGC 


TGAAACGTTT 


GGGTTATAAT 


TTCATAAACT 


GGTGCGCCTT 


CATGGTGATA 


1320 


CTGTCGAATA 


AATAATCATA 


ACCATATTTA 


CCTCCTTNGG 


CTACTCTATG 


GG TAT ATT AT 


1380 


AAATAACATT 


TTTATGTGTG 


ACATCAACCT 


TAAGTATCAA 


CTTTTTATCA 


GACATAGAAC 


1440 


GTANGATTTA 


CTAAGACTAT 


TTATGTATAA 


AAGTTCTAAA 


TAAATATATA 


TTTATAGAGT 


1500 


CGCCTGGCAG 


NCATTTGGGA 


AATATAACAT 


ATATGATTAG 


AGAGGCATCT 


ATCGCAAAAG 


1560 


AATGATAATG 


ATAGAGGTAT 


TGAGCATATA 


GATGAGTTTA 


AGTTCATCTT 


GAAAATAAAG 


1620 


GGTTATTTAG 


TCATAGATGT 


AGATGTATAG 


GAAATATTTG 


TATGTATTGN 


TCGATATGTA 


1680 


TGAAATTTTC 


AATAAAAGCT 


AATAACGCTT 


ATATGTAACT 


TTCAAATTTA 


AATTATATAC 


1740 


AGAGCATGAT 


GATTATAAAA 


AAATANCCAC 


ATCACATAAA 


TTGAGTTCAT 


ACCCAATTTA 


1800 


AGTGGTGTGG 


CTAATAATGT 


TGATTTATAG 


ATGAACCGCC 


TAATCGTTAA 


ACCTCTGTTA 


1860 


CTTCAACATC 


GATATGTTCA 


ATACGGTTGT 


ATGCACCGTG 


ATCCACAGGA 


CCAACAAAAT 


1920 


CATTCATTTT 


CCAACCGTTT 


TTAATAGCAG 


AAGCGACGAA 


AGCTTTCGCT 


CGTGCTAATC 


1980 


AC AGCTTC TT 


TCGGTGACTT 


ACCGTTAGCT 


AAATATGCAG 


GTGTTGCCGC 


AGCAAATGTA 


2040 


CAACCAGCAC 


CATGGTTATA 


ACTTTGTTGG 


AACATGTCTG 


TTGTTAGTTG 


ATAAAATGTG 


2100 


TGACCATCAT 


AGTATAAGTC 


ATACGATTTA 


TCTTGATCTA 


AAGCTNTGNC 


ACCTTTAATG 


2160 


ATGACATGCT 


GTGCGCCTTT 


ATCAAAGATA 


ATTGNTGCAG 


CCTTTNACAT 


ATCTTCAATT 


2220 


GAATTTAATT 


TACCTAATCC 


TGATAATTGA 


CCCGCTTCAA 


ATAAGTNTGG 


TGTCACTACC 


2280 


GGTGGTTTAG 


GTAGTAAATA 


TTTAATCATC 


GCCTCAGTAT 


TTCCAGGATT 


AAGCACTTCA 


2340 


TCTTCGGCCT 


TTACAANCCA 


TGACAGGATC 


TTACTACAAA 


ATATTGTGCA 


ATTAGATGCT 


2400 


CATATACTTC 


TCCAGNACGG 


GTGGATATCT 


CCTCAAG 






2437 



35 

(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 479 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

45 

<ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

SO 



TTTCCCCCNA 


CTCCATAATC 


TTGGCCAAAA 


TCATGTTTAA 


ATTTGATCTT 


GATTTGGACC 


60 


CAAAGATAGC 


TTGCATATTG 


TTTCCCAACT 


TCTAANACAC 


CTGATGCGCC 


TAAAGCTTTA 


120 


AATACCTGCT 


ACATCTACTT 


TTGATTTATC 


CAACCCACTT 


CTACGCGCAG 


ACGTGTAATA 


180 


CATGCATCTA 


AATGTTTAAT 


GTTTTCTTTT 


CCACCCCATT 


GNATCTAAGA 


CATCANATGG 


240 


TAATTTTGCG 


ACACTAGAGT 


TACGAATTTC 


AGTTTCTTCA TCTTCACGAC 


CTGGTGTTTT 


300 
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CAATTTAAAC TTACGAATTN CACAGTCGAA TGAGANGTAA NACACGATAG CAGACACGAC 3 60 

AGCGTTCTNN GACTGGTAAT NGGGCATTNN GCCCAACCCA ATTTATGTGA GCTAACTCAG 420 
GTTGAACNTT GACCATCTTT NNATTCTTCA ACCCANAACC TGCAGGCGTA TTCATAANT 479 

(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2593 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:193: 





CCAAACTGTT 


CTGGTCTAAG 


ATATTATCGC 


TAATCCGTGT 


TAAGCCACTC 


AGCTGTTTAA 


60 




TATCAATTAC 


CTTTTTNAAA 


TCCCTCCAAC 


CTAAAATCTT 


GATTCCCACC 


TTGCTTTAAT 


120 


25 


AAAATCTTTC 


GTTTTAGTTG 


TATTATGTTT 


AAACTTCTTG 


CCAGTCTTTT 


CGGTAAGATA 


180 


ATCAATAATC 


TCTTTATATG 


GGATGCGTGT 


CGGGTTTCCC 


GACAATATAT 


CTACTCTATT 


240 




TATATTGTTA 


TTACTTGTAT 


TATTAATACT 


TGTATTGTCT 


CTTTAACATT 


TGTGATAATA 


300 




GGGGTATTAA 


CAGAATTGTT 


AATAGGGGTA 


TTATCATTTG 


TGTTAATAGG 


TCTTATCATT 


360 


30 


TCTGTTAAGG 


GGTATAGCTT 


TCTTTGTTTA 


ATTTCATTAC 


CATTTCTAAT 


GATTTCAACA 


420 




TGTAAATATC 


CACATTCTTT 


TAAGTTGGCT 


ATACGGCGTG 


ATACAGTAAC 


TTTTGTAACT 


480 




TCATATAGTT 


TCGCAAAGTA 


ACCATTACTT 


GCTGTGCAGT 


ATCCGTACTT 


GTTACTTAAA 


540 




GACGTTATTT 


CTGCAAAAAG 


TAACTTTTCG 


CTGTCAGTAA 


GTCGGTTATC 


GTATCTGACA 


600 


35 


TTTGCCGTTA 


TTATTGAGTA 


GTAACTTGGT 


TGTTCAGTCA 


TTCTCAGCAC 


CTTCTTTCAG 


660 




TGCTTTGAAC 


TTGTCTGGTA 


TCTCCCAGTT 


AGATATGAAT 


TCTTTCAATT 


CATCAGTCAT 


720 




AGGTACTTCG 


TTTAG TATTG 


CGTCATCACC 


AAACAGATAT 


AAAACTATCT 


TGTTATATGC 


780 




TAATGCCGCT 


TTTTC TGTGC 


TATCAAAATA 


TTCCGTAGTA 


TATAGCTTGC 


CATCAATTTT 


840 


40 


TTTCCTAACA 


GAAAATCGAT 


GCGGCATATC 


AG AG TGG TTT 


CCAATTCCTC 


TGTAGCCATA 


900 




TTTACTAGCG 


TTTCTTGTGT 


TGTTAATATT 


TTTATGAGGA 


AAATAATCTC 


TAACTTTACG 


960 




CATATCTCTT 


CCAATTATGT 


TTTTAAAACC 


TTCCACCACC 


CCAAAACTCA 


TCTACAGCTT 


1020 




NGTTATACGC 


TTCAGCTGCT 


AATTCTTCTT 


GTTTNAAAAA 


TACCTAAACG 


TTTTNGGATT 


1080 


45 


TCCTTCAACA 


TTTATATTGG 


CTGTCCATTT 


ATTGCAATCT 


TNACGCCAGT 


TAACACCTTT 


1140 




ANACTTAGAA 


GAACCATTAC 


TTTTAGGTNT 


CTCCCATCTT 


GANCNGTTNN 


NACC TTTTGT 


1200 




AGTTAAATTA 


GATTTAGTAA 


AATTATTATT 


TTTAATTTTT 


TGAAAACTTC 


CTTCTTTTAT 


1260 




AAAGGTTGTT 


AATAGTACAA 


CCTTTCTCTC 


GTTGCCAATC 


CTAGTATGGA 


TATATCTTGT 


1320 


50 


ATTACCTTTA 


TAGTATTTGA 


ACCATTTATA 


TCTGTTCCAC 


TCTTTCATAA 


TCTTCATCAT 


1380 




CAACAAAAAT 


TTCTTCTCCA 


TCTTGTAAAA 


ATATCGATTT 


AACCATTATT 


CTCTTCCTTT 


1440 




CAGCATTTTG 


TTGAGCCTCT 


CATCAACTTT 


TATCCACGAG 


TCATGCAAGT 


GATATTTATC 


1500 




ATCAAACGAC 


TTAACACCAA 


TCGCATGTTG 


CTCGTTGTGA 


TGTTCGCGAC 


ATAACGCTAA 


1560 


55 


TACATGTTTG 


TTGTAGTGAT 


TCATTTTGTT 


TCTGTTCATT 


CCTCTGCCGA 


CTGCTTCATA 


1620 




ATGCGCTAGG 


TCTGCGTGAG 


GCTTTCCACA 


AATTACACAG 


TTGCGGTTGA 


TTGTAGCCCA 


1680 
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ATACAATAGT 


GCTTTATCCT 


CACTTAACAA 


CTTGCTCTCG 


TGCCTATGCT 


CATAGGTAT.T 


1740 




TGATGATGAA 


ACATAAACGC 


TATAATCAGT 


TCTATTAACT 


CCCTTGCAAC 


TTTCATAGAA 


1800 


5 


CAGTCGCGCA 


GACTGATTTC 


TTCATAACCT 


TTCATAATTT 


CCAATTCTGT 


TTGTAATAAT 


1860 




TTNCTAATTG 


ATTCCACCGG 


TTCTCCCCAG 


TGAAGTTCTA 


TATCTCTACA 


CATNGCGAAT 


1920 




ATTTTTTTGC 


GTTGTTC TAT 


AGATAGTTTT 


TTATTATCCG 


GAACCTCTAC 


TTCTGCTTTT 


1980 




AGTGGATATC 


CGTTTTC T AG 


TAAGTCAATG 


TGACTTTGTT 


CAAGTTCAAC 


ACCANTAGCA 


2 040 


10 


ACGACGGAAT 


AAGTNCCGTC 


ATTGTCNNTC 


TGGNATCTTG 


TAATGTATTG 


CANNTAAACC 


2100 




CACACCTTAA 


ACGCTAAATC 


TTGGTCGTCA 


TATCCAAATT 


GCCCCCTGCT 


TTCAAATGGA 


2160 




TTGCT TTGTT 


GAGACATTGA 


TGTTTGTTGT 


TGTGCCCCGT 


TATTTTCTTC 


AGCTTTTTGC 


2220 




TTATCTGTCT 


TCGGAATAGG 


TTTGTTAACA 


ACATCATCGC 


CCTTTTTGTA 


AGGTTTAATA 


2280 


15 


AATGAAAAAT 


CCGTAAAATA 


CTTACCTTCA 


TCTTCATTGA 


ATTTCCATTT 


CAATACCAAG 


2340 


TGACANAACT 


TACCAATAAG 


ATCATTGGTA 


TCANAATCTA 


AGCTAGGAAG 


ATTTAACTTA 


2400 




ATACCTAATC 


GAGTAACTAA 


TTCAATCAAT 


TGTTTTTCTT 


GGAAATCATA 


TTTATACGGC 


2460 




GGTACAAATT 


GATTATGTTT 


ATATTGTTTG 


CCTTCATCAT 


TTTCAAATAC 


GATTGTGAAA 


2520 


20 


TATCTATTTT 


CTCTATCATT 


AGAAGAAAAC 


CTGATCATCT 


TTTTTAGGTT 


TCTTAGATTG 


2580 




GAATTGTTGA 


GGG 










2593 



(2) INFORMATION FOR SEQ ID NO: 194: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3191 base pairs 

(B) TYPE: nucleic acid 
3Q (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 194: 





CCCCAAGTCT 


CATATGAAGT 


AGTTGGATTA 


GTTCCGAAGT 


ATGAGGATGG 


GAATTTCTTG 


60 


40 


ACATAGTAAA 


GTGCCGGATG 


ATAGTGCAAG 


AGGTATCGTT 


GCACAAAATA 


TGACAACTGC 


120 


TGAGATTGAA 


ACATTTGGTT 


CAGATCCAGT 


TATTATGCCA 


ACGGGGTGCC 


CCTGGTATTA 


180 




TTTTCGGTAA 


AACAACAAAC 


TCAATGATTA 


ATACAGGATC 


AGCGGCTTCC 


ATTGTTTACC 


240 




AACAAGGCGC 


TATTTATGCT 


AATGCTGAGT 


TCATTCAAAT 


TCATCCTACT 


GCAATCCCTG 


300 


45 


GTGATGATAA 


ACTCCGACTA 


ATGAGTGATC 


AGCACCNTTG 


TTGAAGGTGG 


ACGAATTTGG 


360 


ACATATAAAG 


ATGGTAAGCC 


TTGGTACTTC 


TNAAGAAGAG 


AAATATCCTG 


ATTATGGTAA 


420 




CTTAGTACCT 


CGTGATATCG 


CAACCGCGTG 


ANATTTTCGA 


TGTATGTATT 


AACCAACAAA 


480 




TTAGGNATAA 


ATGNCGAAAA 


CATGGNATAT 


CTTGATTTGT 


CACATAAAGA 


TCCAACATGA 


540 


50 


GTTAGATGTA 


AAACTAGGTG 


GTATCATTGA 


AATTTATGAN 


AAATTCACTG 


GTGATGACCC 


600 




ACGCAAAGTA 


CCAATGAAGA 


TTTTCCCAGC 


TGTTCACTAT 


TCAATGGGTG 


GTCTATATGT 


660 




AGATTATGAT 


CAAATGACAA 


ATATTAAAGG 


GTTATTTGCA 


GC TGGAGAAT 


GTGACTTCTC 


720 




TCAACATGGT 


GGTAACCGCT 


TAGGTGCCAA 


TTCATTGTTA 


TCAGCGATTT 


ATGGTGGTAC 


780 


55 


AGTAGCAGGT 


CCAAACGCGA 


TTGATTAATA 


TTTCAAATAT 


TGATCCGATC 


CAT AT AC TGA 


840 




TATGGACNGA 


AGTATTTTTG 


AAAAACGTAA 


AGCTGAAGAG 


CAAGGAACGT 


TTTGATAAAT 


900 
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TATTAGCTAT 


GCGCGGTACA 


AGAAAATGCC 


TATAAATTAC 


ACCGTGAACT 


TGGTGAAATT 


960 




ATGACACCAA 


ATGTAACTGT 


TGTTCGTGAA 


AATGAAAAAC 


TGTTAGAAAC 


AGATNAAAAG 


1020 


5 


ATTGTTGAAT 


TGATGAAACG 


TTATGAAGAT 


ATTGATATGG 


AAGATACTCA 


AACTTGGAGT 


1080 




AACCAAGCGG 


TATTCTTTAC 


CCGTCAACTA 


TGGAACATGT 


TAGTAC TTGC 


ACGTG TTATT 


1140 




ACGATTGGTG 


CATATAACCG 


TAACGAATCA 


CGCGGTGCCC 


ATTATAAACC 


AGAATTCCTG 


1200 




ATAAGTATTT 


TCGTCAGAAG 


TACATAATAT 


TTAAATCATT 


TAA TTTTG AG 


AAGCATGGAA 


1260 


10 


AATTTTGGAA 


CAAATGGTTT 


TACGTAAGAA 


AATGGAAACA 


TAAGATTTTA 


GATGGTCATC 


1320 




AGCTTAATCA 


AAAT AT A TAT 


GATCAGCGTC 


ATTTAATGAC 


AATCAATACT 


GATGAAATTG 


1380 




AAAAAATGAT 


TATAGAGACA 


AAG AGGGC AG 


AGTTGATTCA 


TTGGATATCG 


ATACTTCCAG 


1440 




TCATCATATT 


CAATAAAGGC 


TCTCGTTTAG 


TAAAGTATAT 


AAATATTTTC 


TATGCAATGA 


1500 


15 


TAGCTAATGT 


TCCAATCATT 


ATTGTGCAAC 


GCTATAATCG 


ACGAGATTAA 


CGCAGGTACT 


1560 




ACGCATTTTA 


AAACGAAGAG 


GTGAAC GTC A 


TGACTAAACA 


TATCATCCGN 


TATTGGGTGG 


1620 




GNGGCTTAGG 


TGGGATTTCN 


TGCAGCAATT 


CGAATGGACA 


AAAGTGGCTA 


TTC GGNCTC A 


1680 




TTATATGAAC 


AAAATACTCA 


TATAGGAGGC 


AAAGTGAATC 


GNCATGAATC 


AGATGGCTTT 


1740 


20 


GGCTTTGATT 


TAGGTCCATC 


TATTTTAACG 


ATGCCTTATA 


TTTGTGAAAA 


ATTATTCGAA 


1800 




TATAGCAAGA 


AGCAAATGTC 


AGACTACGTT 


ACAATCAAGC 


GTTTGNCACA 


TCAATGGCGT 


1860 




AGCTTTTTTC 


CAGATGGCAC 


GACTATCGAT 


TTGTATGAAG 


GTATTAAAGA 


AACAGGTCAG 


1920 




CATAATGCGA 


TATTGTCGAA 


ACAGGATATA 


GAGGAACTGC 


AAAATTATTT 


GAATTATACA 


1980 


25 


AGACGAATCG 


ATCGTATTAC 


TGAAAAAGGG 


TATTTTAACT 


ATGGTTTAGA 


TACACTATCT 


2040 




CAAATTATTA 


AATTTCATGG 


GC CATT AAAT 


GCTCTTATTA 


ATTATGATTA 


TGT AC AT AC T 


2100 




ATGCAACAGG 


CCATAGACAA 


GCGTATCTCG 


AATCC AT AC T 


TGCGACAAAT 


GTTAGGCTAT 


2160 




TTTATCAAAT 


ATGTAGGTTC 


TTCATCATAC 


G A TGCGNC AG 


CTGTATTATC 


TATGTTATTC 


2220 


30 


CATATGCAAC 


AAGAGCAAGG 


CCNTTGNTAT 


GTAGAAGGTG 


GAATCCATCA 


TTTNGCCAAT 


2280 


GCCTTGGAAA 


AGCTAGCGCG 


TGAAGAAGGT 


GTCACAATTC 


ATACAGGTGC 


ACGTGTGGAC 


2340 




AATATTAAAA 


CATATCAAAG 


ACGTG TGACG 


GGTGTCAGAT 


TAGATACAGG 


TGAGTTTGTA 


2400 




AAGGCAGATT 


ATATTATTTC 


AAATATGGAA 


GTCATACCTA 


CTTATAAATA 


TTTAATTCAC 


2460 


or 


CTTGGATACT 


CAACGATTAA 


ACAAATTAGA 


GAGGGAATTT 


GAGCCGGCAA 


GCTCAGGATA 


2520 


TGTGATGCAT 


TTAGGTGTTG 


CTTGCCAATA 


CCCGCAATTA 


GCACATCATA 


ATTTCTTTTT 


2580 




TACGGAAAAT 


GCTTATCTCA 


ATTATCAACA 


AGTTTTTCAT 


GAAAAGGTAT 


TGCCAGATGA 


2640 




TCCGACCATT 


TATCTAGTAA 


ATACGAATAA 


AACTGATCAC 


ACACAAGCGC 


CAGTAGGGTT 


2700 


40 


ATGAAAATAT 


CAAAGTCTTA 


CCACATATTC 


CATATATTCA 


AGATCAGCCT 


TTTACCACTG 


2760 




AAGATTATGC 


GAAGTTTAGG 


GATAAAATTT 


TGGATAAATT 


AGAAAAAATG 


GGACTTACTG 


2820 




AT TT AAG AAA 


ACACATTATT 


TATGAAGATG 


TTTGGACACC 


GGAGGATATT 


GAAAAAAATT 


2880 




ATCGNTCTAA 


TCGTGGTGCA 


ATATATGGTG 


TTGTCGCAGA 


TAAAAAGAAA 


AACAAAGGAT 


2940 


45 


TTACCTTTCC 


TAAAGAAAGT 


CAGTATTTTG 


AAAACTTGTA 


CTTTGTAGGT 


GGATCAGNAA 


3000 




ATCCTGGTGG 


TGGCATGCCA 


ATGGTTACAT 


TAAGTGGGCA 


ACAANTCGCA 


GACAAANTNA 


3060 




ACGCGCGAAG 


AAGCGAANGA 


ATAGGANGGT 


GANATCTATT 


GAAATCGGTN 


NTCACGACTA 


3120 




TTTAANAGAC 


AATAGTGGNN 


ACCCATCNTC 


TTNTGGGGNC 


TTGGGAGCAT 


TTGGATNTTC 


3180 


SO 


NNCGGGGGNC 


A 










3191 



(2) INFORMATION FOR SEQ ID NO: 195: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1189 base pairs 
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_{B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 



CCTGGAATAA 


TTGG TGGTCC 


GTAATTGATG 


GGCCAAGTTG 


AAATTGGAAG 


CTAAAAAAGT 


60 


CTTAATCCGC 


CCAAATTGTT 


AACAACACAC 


CTATTGACTT 


TGTCAATCCA 


ACCCATCAAT 


120 


CCTTCATTAT 


TAACATGTCC 


CAGGTATCTT 


TAAGNTTGAT 


GAAACATTAG 


ATACCAGTTG 


180 


TTGGCGAAGG 


CTTAAAGCTT 


TGCC ANCTTG 


AAAAATTAAG 


CCGGAATTAA 


GATGCATTGC 


240 


GTCGCCTATT 


TGAATCCAAA 


CCGATAGTGT 


TAAATATGAT 


AAATTAAAAG 


CACGTTATGA 


300 


GCGTTTCCAA 


AATCAATCAT 


TCAAAAACTT 


AGATTATGAT 


TTCGAAAGCG 


TGCGTACTTC 


360 


AAGACAATCA 


CCATTCGCGC 


AACGTATTGA 


ACAACAACAA 


AAACGTTTGA 


ACTTACCGGA 


420 


TTTACCAACA 


ACAACTATTG 


GATCATTCCC 


ACAAAGCCGA 


GAAGTTCGAA 


AATAC CGTGC 


480 


AGATTGGAAG 


AACAAACGCA 


TTACAGACGA 


AGCATATGAA 


ACATTCTTAA AAAATGAAAT 


540 


TGCTCGATGG 


ATTAAAATTC 


AAGAAGACAT 


CGGC TTAGAT 


GTATTAGTTC 


ACGGTGAATT 


600 


TGAACGTAAT 


GACATGGTTG 


AATTCTTCGG 


AGAAAAATTA 


CAAGGTTTCT 


TAGTAACTAA 


660 


ATTCGGTTGG 


GTGCAATCAT 


ATGGTTCACG 


CGCCGTAAAA 


CCACCAATCA 


TTTATGGTGA 


720 


TGTAAAATGG 


ACAGCGCCTT 


TAACTGTTGA 


TGAAACAGTT 


TATGCACAAA 


GCTTAACAGA 


780 


TAAACCAGTT 


AAAGGTATGT 


TAACTGGACC 


TGTAACAATT 


CTAAACTGGT 


CATTTGAACG 


840 


TGTTGATTTA 


CCACGTAAAG 


TCGCTCAAGA 


TCAAATTGCT 


TTAGCAATCA 


ACGAAGAAGT 


900 


ATTAGCACTT 


GAAGCTGCAG 


GAATCAAAGT 


TATCCAAGTT 


GACGAAC CTG 


CATTACGTGA 


960 


AGGCTTACCA 


TTACGCTCTG 


AATATCACGA 


ACAATATCTT 


AAAGATGCTG 


GTTTTATCAT 


1020 


TTAANCTTGC 


AACGTCTTCA 


AGTTCGGTGA 


TGAANCTCAA 


ATCCATACAC 


ATATTGTGTT 


1080 


ATTCTCAANT 


CGGGCAAATC 


AATCAATGCT 


ATTCAAGATT 


TAGATTGCTT 


GATGTTATTC 


1140 


AATTGAACAC 


TCCGNGCCNT 


GGGGATTTAA 


TTAAGNTTGG 


AGGTTTTAT 




1189 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3815 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(2) INFORMATION FOR SEQ 



ID NO: 196: 



40 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 



GGAGCATAAT AAAGAGCGTA TCTTAAACAT TAATTTTAAG TGTGATCCCA AAGTAATTTT 



60 



55 



CGTGATTCAT TGAAAATTGT TCCGAAAGNA CCGTTGTTCA TCCAAGACAA TATTTACCTT 



120 



AACAATATTG ATTTGGGGTT ACCTGGAACG ATGTTATTAA GTTTGGTTAG CTGACAAAAA 



180 
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TAGTCGTTAC CGCCATCGCA GGACCCAATG 
ATGATGCATC CAAAAAATGG AATATATAGA 
CATCCTTGTG ATAATCACAT TGTTTATCTT 
ACCATATATC GATTTAATGA ACGTTTC CGC 
AGTTAACCCT GCAGGCAGTA TTAC TTTAAT 
AAGTGGCATT CATTTTATTT TATCTATGTT 
GCTGACATTT GATAGTCCAG TAGGTATTTC 

10 

TTATTTTTTA TCGAGATTTT TAATCAATAC 
TGGCAATTAT TTTTCAGGAG TTAAACCTGG 
AGCAAGACGC GTATGTTGGT TTGGATCGGC 
is TTATTTCACA TTGTTTGTAC CGCATTTATC 

TGTATTAGTT TACATCAGTA TTAATATTGC 
TAAATATAAG CCATTTTTAA ACCAGTATTG 
CAGCTTGGTA CGATGACCAA CGATGGTGGC 
AAAATAAGAC GGAATTTGAC GATATGATTA 

20 

TAGATTATCA ACTGATTGTT CTCAATCATG 
ATGACT TATA TGAAACAAAG TATTCGTCTG 
ATGCGCCACA AGCGATTAAT TATCATCACT 
TACACGCCCG NATTTATTAA AAATGTGTGA 

25 

TNAGNCAAGA AGGGTATTCA NTTGGGTTTG 
CGGCGTTATA TTTTTGATGA CAGAGGTTAT 
GGAGAGGCTT CTTACCAACA ATATTTAACG 
GATTGGANAA ATGGCAGGAG TCACTGTATC 

30 

ACAGAATATA ACAATATGGC TCAACTAATT 
ACAAATACAT GAAGATGATC ATGTGATTGT 
AGCCAATCAT ATTCCAGCGA AATTGTTAAG 
TGTGTCAGAT GAGGAATATC AATCTATCGT 

35 

GCAACTAGAA CGTGATTTAA TTAGTCATCA 
TCGAATCACA CCATTTGAAA CGAGACAATC 
ATTTATAGGT GTATGGATAG ATGGTATGAG 
AC TTGTGGAT TATATCGCAC AGGAAGATTA 

40 

AAATGACATA CCGATGTGGC TTCGTGAATG 
TAAACAGAAT GCGGATGTTA ATGTTTCAGC 
CATTGCTGTT AAGACGATAC ATGCTGAACA 
AC TTGTGAT A GATATGTCAA AAGAACCTGA 

45 

TGGGATTCCA CAAATTAATG GTCAACAAAC 
TATTATAAAT AC AGC TGG AT GAATTAGATG 
AAAATTGGAA TTATGCGTAC GCCTATTCTT 
ATATTATTAA TCAGCTCGAT GAGTTAATAG 

50 

GTTTTGCAAA TTGGAGGAGA CGATTTAGAA 
TGGGATTACT TCGATATTGG ATTGTTTGAA 
GCGATTGTTG AAGCAGAAGG GCGCTTTGAT 
55 ACATTAACGA ATTTATTACA AATGATAAGC 

TTTTGGTCAG TTGAATATGA ACAAGACGAA 



CCGATTGTAA 


TGGTTAGCAT 


TATTAAATCA 


240 


TGCAAGTCAT 


ATTGTGATTG 


CGTTGTTAAT 


300 


ATTATTCATT 


GAATTAGTAG 


AAGTGAGAAT 


360 


AACAAATATG 


AAATC T T AT T 


TATCTTGGAA 


420 


GATGAGTATT 


TCAGCATTTG 


TTTTCTTGAA 


480 


TAATAAAAGC 


ATATCAGATG 


ACATGCCAAT 


540 


AGTATATTTA 


GTGATTCAAA 


TGTTATTGGG 


€00 


TAAACAAAAA 


TCCAAAGATT 


TCTTAAAGAG 


660 


TAAGGATACA 


GAACGTTATT 


TAAATTATCA 


720 


ATTAGTTACA 


GTCATTATTG 


GTATACCGCT 


780 


TACTGAAATT 


TATTTCTCAG 


TACAACTGAT 


840 


AGAAACAATT 


CGTACATATT 


TATATTTTGA 


900 


GTAAGGAGGT 


AATTATGAAA 


TACTTTATTC 


960 


AAGACACGAC 


TGTGCCGTAT 


TATCAACTAC 


1020 


GTTTAATGGG 


AATGCACCTT 


GAAAATGACT 


1080 


CACCAAATTT 


AAGAACATTT 


TTACATCGAT 


1140 


TGTTTGATGA 


AATTCAAGGA 


TTCAGTCACC 


1200 


TAAAATGGNC 


GGATGATGTT 


GGAGTNTGGG 


1260 


CGAGTGAACA 


GACC T AT AC A 


NATATTTATT 


1320 


NAGANTGATT 


NGAAAGAGAT 


CAAGTNACAA 


1380 


TTATCAGCGA 


TACGTTATTT 


TGATGATCAG 


1440 


GATTAATGGA 


GATTGTGTAA 


CTTCATGGAA 


1500 


NANAAGGATA 


TTCAACATCA 


CTATCAACAA 


1560 


GGAAGAAAAA 


TTTCAAGCAA 


TGATTGCACA 


1620 


GGCTTCAGAT 


GCTAGGCACA 


ATCGACAAAT 


1680 


TTATTCATTT 


TTTAAAAATA 


GAAATGAANC 


1740 


AAAGAATGCC 


CATTTAATTG 


TTGATAGTGT 


1800 


AGAGAAGTAT 


CAGCGGGAGA 


ATACAATGAT 


1860 


ACCTAATATA 


AGTAGTCAAT 


TGATGGAAAC 


1920 


TGACGCTGAT 


TTGCAACAAA 


TGATGCAACG 


1980 


TTACCGTTTA 


ATTTTATTAT 


CGCGCCATCA 


2040 


TATTACGTCG 


GTAAATGAGG 


AATACCAAGC 


2100 


ATTAATGACA 


CCTGAAGATC 


AAGATGACAT 


2160 


TGATGTTGTA 


GAAGCATTGC 


GGACGTTGCG 


2220 


TTTGTATTTA 


CAAATTAGTG 


CAATTAGCGC 


2280 


AGATTACGTC 


TCTGATTATG 


ACAATGGCCG 


2340 


ATGCGTTAAA 


TTATTATTTA 


TTTTATTTGA 


2400 


TAAAATTAAT 


AGATGCATAT 


GCTTCTAAGA 


2460 


AAGGTGAAAA 


TGATGCC ACG 


TAAATTTAGA 


2520 


CCTATTTTTC 


AACACAAAAA 


AGG TGTGAGT 


2580 


TTTGATAGTG 


GTTATGTAGA 


GGCTATTGAA 


2640 


TTTATCTATA 


TTCAAGCACC 


ATACTCGGAG 


2700 


GAACCATACA 


ATACGTATGT 


TGATGAATCA 


2760 


AATTGTCCAA 


AAATACGTTG 


TTCAACCAAT 


2820 
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5 



10 



15 



20 



TACATTACCG 


GAATATTGGA 


AGGAACGTAA 


TAATNAATTA GAGGCAGGNT AGCTTCTCAA 


2880 


GGACAAGGAT 


TGGAGATAAA 


GTTTCCCCTA 


AGTTAGCACT 


TGTGCATCCG 


AATTTTAAAG 


2940 


GAGATGTCGA 


ATACCCAAGG 


TAATTCAAGA 


GCTCACGTTG 


AGTGGAGAAT 


TTGGAAAAGA 


3000 


ATTTAAACCT 


ATCGCATCTT 


GNCAAAATAA 


TCTCGTTTAC 


GATAAAGATA 


AAGTCATTCA 


3060 


NATATGGCCA 


GAATTTGATA 


TTGATGGTGC 


GGTCGAGTTG 


CAATATACAT 


TTAGATTGAT 


3120 


TCAGACTGGC 


GCTGATGGTG 


CATTAATTGA 


ACAAATCATA 


TTAACTGATG 


ATATGTTAGA 


3180 


CAGTCCTTTA 


GA^ATACPTG 
onun j. \, l vj 


CGAAACCATT 


TGATGCTTAT 


ATAAGTGTAA 


CTGTTAAGGC 


32 40 


GCGTGGGAAC 


GGGACGGTAC 


ATTTAGGACC 


TAAACACACA 


CGATGGTCCA 


GANTAGANAN 


3300 


GGNTC AATTT 


TTACNTGGTG 


GGAATCGTTT 


CGAAGATANN 


CNCCGNCNGG 


AATTTAATTA 


3360 


TTAATCCCNC 


CCTGGTGATA 


TGANACCCCC 


ACT AAACG TA 


AATTTTAGTG 


GTTATCGANC 


3420 


ACCGGAAGGT 


TTC GAAGG AT 


ATTATATGAT 


GANACGTATG 


AATGCNCCGT 


TTTTACTTAT 


3480 


CGCCGATCCT 


CGTGTTGANG 


GTGGTAGCTT 


TTATATCGGT 


TCATCTGAAT 


ATGAACAACG 


3540 


TGATTATCAA 


TGTTATTGAC 


GAGACAATAT 


TCATGCTGTA 


AATCCAACAT 


CAGGTGCAGC 


3600 


TGGTAAGTAT 


CAATTCTTAC 


AATCAACTTG 


GGATTCAGTA 


GC AC CTGCT A 


AATATNAAGG 


3660 


TGTATCACCA 


GCAAATGCTC 


CTGGAAGTGT 


TCAAGATGCC 


GCAGCAGTAA 


AATTATATAA 


3720 


CACTGGTGGC 


GCTGGACATT 


GGGTTACTGC 


ATAAGCCATT 


TATGCATAGC 


TAATCAATAG 


3780 


TTATATAAGT 


AACTTTTAGA 


TCGGAATATA 


TCGGG 






3815 



(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2848 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 



40 



45 



50 



55 



GGCNCGAGAC 


GCTTAATTAT 


GGAACNCCAA 


GATGCNATTG 


GACNCGGTTA 


TAACATTGGN 


60 


ACAGGCACTT 


TTACT AATTT 


ATTAGAGGTT 


TATCGTATTA 


TTGGTGAATT 


AAATATGGNA 


120 


AATCCAATCG 


AGCNTTGAAT 


TTAAAGGAAG 


CACGAAAAGG 


AGATATTAAC 


CATTCTTATT 


180 


GNAGATATCT 


CT AAC TT AAA 


GGCATTAGGA 


TTTTGTGCTC 


CTAAAAAATA 


CAGTAGAAAC 


240 


AGGTTTAAAG 


GATTACTTTA 


ATTTTGAGGT 


AGATAATATT 


GAAGAAGTTA 


CAGCTAAAGA 


300 


AGTGGAAATG 


TCGTGAAAAT 


GACATTGAAG 


C TGTCC AT AA 


TAATAAGGGT 


TATGCCTATC 


360 


AAAGAAAATT 


AGATGCACTC 


GAAGAAGTGA 


GAAAAGGCTA 


TTACCCAATT 


AAAC GTGCG A 


420 


TTGACTTAGT 


ATTAAGTATC 


GTTTTATTAT 


TTTTAACATT 


TCCGATTATG 


TTCATATTCG 


480 


CCATTGCTAT 


CGTCATAGAT 


TCGCCAGGAA 


ACCCTATTTA 


TAGTCAGGTT 


AGAGTTGGGA 


540 


AGATGGGTAA 


ATTAATTAAA 


ATATACAAAT 


TACGTTCGAT 


GTGTAAAAAC 


GCAGAGAAAA 


600 


ATGGTGCGCA 


ATGGGCTGAT 


AAAGATGATG 


ATCGTATAAC 


AAATGTCGGG 


AAGTTTATTC 


660 


GTAAAACACG 


CATTGATGAA 


TTACCACACC 


TAATTAATGT 


TGTTAAAGGG 


GAAATGAGTT 


720 


TTATTGGACC 


ACGCCCGGAA 


CGTCCGGAAT 


TTGTAGAATT 


ATTTAGTTCA 


GAAGTGATAG 


780 


GTTTCGAGCA 


AAGATGTCTT 


GTTACACCAG 


GGTTAACAGG 


ACTTGCGCAA 


ATTCAAGGTG 


840 
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10 



15 



25 



30 



40 



45 



GATATGACTT 


AACACCGCAA 


CACAAACTGA 


AATATGACAT 


GAAATATATA 


CATAAAGGTA 


900 


GTTTAATGAT 


GGAACTATAT 


ATATCAATTA 


GAACATTGAT 


GGTTGTTATT 


ACAGGGGAAG 


960 


GCTCAAGGTA 


GTCTTAATTT 


ACTTAATAAG 


TTCAAATAAA 


AGTTATATTT 


TAAAGATTGT 


1020 


GACCAATNGT 


TACAGTNTAA 


CGAGGANTCC 


CTTGNGACNG 


TATCAAATGG 


CATTAAAGAA 


1080 


AT A TGTNCC A 


TCNTTTGATT 


TGCNTGGCCA 


ATAAATACTA 


TTCATCTTGA 


TGAGATAAGC 


1140 


CATGTTAAGA 


AATTGAAAGT 


ATAGCATTAA 


NGGGGTTTGT 


AAC AGTTGAA 


AATTATATAT 


1200 


TGTATTACTA 


AAGCAGACAA 


TGGTGGTGCA 


CAAAACACAT 


CTCATTCAAC 


TCGCCAACCA 


1260 


TTTTTGCGTA 


CACCATGATG 


TTTATGTCAT 


TGTAGGCAAT 


CATGGACCAA 


TGATTGAACA 


1320 


NCTAGATGCA 


AGAGTTAATG 


TAATTATTCT 


CGAACATTTA 


GTAGGTCCAA 


TTGACTTTAA 


1380 


ACAAGATATT 


TTAGCTGTCA 


AAGTGTTAGC 


ACAGTTATTC 


TCGAAAATTA 


AGCCTGATGT 


1440 


TATCCATTTA 


CATTCTTCCA 


AAGC TGGAAC 


GGTCGGACGA 


ATTGCGAAGT 


TCATTTCGAA 


1500 


ATCGAAAGAC 


ACACGTGTAG 


TTTTTACTGC 


GCATGGATGG 


GCTTTTACAG 


AGGGTGTTAA 


1560 


ACCAGCTAAA 


AAATTTCTAT 


ATCTAGTTAT 


CGAAAAATTA ATGTCATGTA 


TTACAGATAG 


1620 


CATTATTTGT 


GTTTCAGATT 


TCGATAAACA 


GTTAGCGTTA 


AAATATCGAT 


TTAATCGATT 


1680 


GAAATTAACC 


ACAATACATA 


ATGGTATTGC 


AGATGTTCCC 


GCTGTTAANC 


AAACGCTAAA 


1740 


AAGCCAATCA 


CATAACAATA 


TTGGCGAAGT 


AGTTGGANTG 


TTGCCTAATA 


AACAAGATTT 


1800 


ACAGATTAAT 


GCCCCGACAA 


AGCATCAATT 


TGTTATGATT 


GCAAGATTTG 


CTTATCCAAA 


1860 


ATTGCCNCAA 


AATCTAATCG 


CGGCAATAGA 


GATATTGAAA 


TTACATAACA 


GTAATCATGC 


1920 


GCATTTTACA 


TTTATAGGCG 


ATGGACCTAC 


ATTAAATGAT 


TGTCAGCCAC 


CAAGTTGTAC 


1980 


AAGC TGGGTT 


AAGAAAATGA 


TGTCACATTT 


TTGGGCAATG 


TCATTAATGC 


GAGTCATTTA 


2040 


TTATCACAAT 


ACGATACGTT 


TATTTTAATA 


AGTAAGCATG 


AAGGTTTGCC 


AATTAGCATT 


2100 


ATAGAAGCTA 


TGCCTACAGG 


TTTGCCTGTT 


ATAGCCAGTC 


ATGTTGGCGG 


TATTTCAGAA 


2160 


TTAGTAGCTG 


ATAATGGTAT 


ATGTATGATG 


AACACCCAAC 


CCGAAACTAT 


TGCTAAAGTC 


2220 


CTGGAAAAAT 


ATTTAATAGA 


CAGTGATTAC 


ATCAAAATGA 


GTAATCAATC 


TAGAAAACGT 


2280 


TATTTAGAAT 


GTTTTACTGA 


GGAGAAAATG 


ATTAAAGAAG 


TGGAAGACGT 


TTATAATGGA 


2340 


AAATCAACAC 


AATAGTAAAT 


TACTAACATT 


GTTACTTATC 


GGTTTAGCGG 


TTTTTATTCA 


2400 


GCAATCTTCG 


GTTATTGCCG 


GTGTGAATGT 


TTCTATAGCT 


GACTTTATCA 


CATTACTAAT 


2460 


ATTAGTTTAT 


TTACTGTTTT 


TCGCTAACCA 


TTTATTAAAG 


GCAAATCATT 


TTTTACAGTT 


2520 


TTTCATTATT 


TTGTATACAT 


ATCGTATGAT 


TATTACGCTT 


TGTTTGCTAT 


TTTTTGATGA 


2580 


TTTGATATTT 


ATT AC GG TT A 


AGGAAGTTCT 


TGCATCTACA 


GNTAAATATG 


CATTTGTAGT 


2640 


CATTTATTTC 


TATTTAGGGA 


TGATCATCTT 


TAAGTTAGGT 


AATAGCAAAA 


AAGTGATCGT 


2700 


TACCTCTTAT 


ATTATAAGCA 


GTGTGACTAT 


AGGTCTATTT 


TGTATTATAG 


CTGGTTTGAA 


2760 


CAAGTCCCCT 


TTACTAATGA 


AATCGTTATA 


TTTTGATGAA 


ATACGTTCAA 


AAGGATTAAG 


2820 


GAATGACCCT 


AAC TATTTCG 


CTCGTGCC 








2848 



(2) INFORMATION FOR SEQ ID NO: 198: 



(i) SEQUENCE CHARACTERISTICS: 

50 (A) LENGTH: 2144 base pairs 

{ B) TYPE: nucleic acid 

{C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

55 

(ii) MOLECULE TYPE: Genomic DNA 
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20 



30 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

CCCCAAAGAA ATACANTTGT TGGTAGAAGA ACCCCCAACC CACAAGNTGG GG AC CCTTC A 60 

TTTCCAACCN TATGGCGGAG GGTTAAAGTA AATTCTTGAA GC TTTGGTTG CCCNCAAATA 12 0 

ATATGATATT TCCGAAATGG GCCGGAGCCG TATCCAACAA TTTAAAATCT GCCGGAGGTA 180 

TTAAAGCATT ACTTAACCGC TCCGATTAAG ATGNNATTGA AGGAGAGATT AAT ATGG TAG 240 

AATCAATGCT AACTTTTATG CTTGGGCCAT TAAGACAAAT CNCTGATTTT TATATGGAAC 300 

ATTTACTCGT AAGTAATTCC ATCGTCATTG CAGGTTATTT TGCGACAGGT ATTTTTAAAA 3 60 

AGAAAAAAGT TGTGAATTAA ATCANATTTG AGGTGATTTA CAAGTGAAAG CATTGAAATT 420 

ATATGGCGTA GAAGATTTAC GGTATGAGGA TAATGAAAAG CCAGTCATTG AAAGTGCGAA 480 

TGACGTTATT GTTAAAGTAC GAGCGACTGG CATATGTGGT TCAGACACGT CACGAAACAN 54 0 

AAANATGGGG CCATACATTA AAGGTATGCC ATTTGGTCAT GAATTTTCAG GTGTAGTAGA 600 

TGCCATTGGA AGTGATGTTA CGCATGTTAA TGTGGGCGAC AAAGTGACAG GTTGCCCAGC 660 

AATACCTTGT TATCAATGCG AGTAGTGTTT GAAAGGTGAA TATGCACGAT GTGAAAAAGT 720 

TATTCGTCAT TTGNCTCAAT ATGAACCTTG GATCGTTCNG CGGAATATGT CAAATTCCCA 780 

CCCCAAAATG TTTTNAAGGG TTCCAAGCCA ATGTTGATTA CCATTGAANC AACCAATGGT 840 

TTGAGCCATC AGCCCGTTGT TGCGCATGGG TTTTATAAAT C CG AAT AN AC AACCTGGTAT 900 

GACTGTTGCA GTAATGGGGT GTGGCAGTAT AGGTTTGTTA GCTATTCAAT GGGCACGAAT 960 

ATTTGGTGCT GCACATATCA TCGCTATAGA TATAGATGCG CATAAACTAG ATATTGCAAC 1020 

ATCATTGGGC GCACATCAAA CAATCAATTC AAAAGAAGAA AATCTTGAGA AATTCATCGA 1080 

AAATCATTAC GCCAATCAAA TCGATTTAGC TATAGAATCA TCAGGTGCTA AAGTTACGAT 1140 

TGGTCAAATA TTGACGCTAC CTAAAAAAGG TGGCGAGGTG GTATTACTCG GAATACCATA 1200 

TGATGATATT GAGATTGATC GCGTTCATTT TGAAAAAATT CTGCGTAACG AGTTGACAGT 1260 

ATGTGGCTCT TGGAAC TGTT TGTCCAGTAA TTTTCCGGGC AAAGAGTGGA CGGCAACCTT 1320 

ACATTATATG AAGACGAAAG ATATTAATGT AAAGCCTATT ATTTCTCATT TTTTACCGTT 1380 

AAGAAAAAGG C CCGGAGAC A TTT TG AT AAA TTAGTTAATA AGAAAGACCG AT T TGAT AAA 1440 

GTCATGTTTA CGATTTATTA GTATGCACCT TTGAGGACGA AAACGCTGGT ATAGTTATAG 1500 

CTATGAAAGT GCGAATGCCG TCTGGTCTAC AGATACTATC GAAATAATTC ATCTTCGAAT 1560 

ATACGTTGAT AAATAGCCGG TTTACTTGTG TGAAATATGC TTGTGAATCG GTTGTTTTGC 162 0 

ATTTTGTATA CTTAAAATGA GATGGCAATA TTTGATAATT TTTAAAGTGA AAATCAAGTA 1680 

CAGCCACTTA ATAAGATAAA TTTATTATAA TAT ATGG TAA AATGATGGCA GTAATAATGA 1740 

ATTTGAAAAA GAGTAAACAT TAATACCTTT AACAATTTAA TATCGTCAGA GTTAATGATT 1800 

AACTGCATGG CAAAACAACT TAGAATGGTC AGTTACNNAA ATACATTTTT ATAAAAAATT 1860 

ATCACACTAT TGTGACAACT ATCTTTGGAT TAATAAAAGA GGC AAGTGAG CAATAGGTTA 192 0 

GGCTTATGTG CGGGC ATAGG TCAGTAATGT AGAAATGGAA ATGATGTAAT ACGTTAATTC 1980 

GTTTGAATCT TTAAAAGTAT TGGATAAATC AGATAATCGC CTGTTATACG CATCAATCAA 2040 

TGTTTTATAC ATAAATATGT CTGATATTTC TAAGTCGTTA TTTTTGAATG GAATAAATGG 210 0 

AATCTTACCC CACGACCCTG TACTAAAATG CGTTTTTRRD CTNS 2144 

(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2178 base pairs 
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.(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 



CAAATAACGG 


ACCCTTTNGT 


TAATATCCCA 


TNTTTTTGGC 


GGAGTATGGG 


ACCTTAGCAG 


60 


AGAAAATAGC 


CATAAGATAA 


GATTTCGCCC 


CANNCGTTGG 


AGCCCAACTT 


ACATTGTTTG 


120 


TAGAATTTCT 


TTTCGAAATT 


CTTTGTGTTG 


GGCCCCCACC 


CCAACTTGCA 


TCGTCTGNAG 


180 


AAATTGGGGG 


TCCAATTTCA 


CTGTGTTGGG 


NCCCCCTGAA 


CTAAATAGTA 


TGGAAATATA 


240 


TATATCTAAA 


TAATTAAAAC 


GGNCAAATAG 


TTTAATATGG 


CAGTAACAAT 


TTTTATAATT 


300 


TAGATATTTT 


GATATAGACT 


AATGCTTAAA 


TTAAGCAACT 


TTTCTANNNA 


GGGAGTATGA 


360 


AAATGATTAG 


TACTTTGAAT 


GAAATTATGA 


AATGTATCGG 


AAGACCNCGA 


TACAATTATT 


420 


AATACACAAG 


ACATGTTAGA 


CCAGATCCAG 


ATGCATATGG 


TTCACAACTT 


GGTTTAAAAT 


480 


ACTATATTCA 


GCAGAAATTT 


CCGCAAAAGC 


AAGTATTTGC 


AGTGGGTGAA 


GCGGAATCAT 


540 


C ATT AAGTTT 


TATTGGAGAG 


TTGGATAACA 


TTGATGATAA 


AACATATCAA 


GATGCGCTTG 


600 


TAATTGTATG 


TGATAC TGCC 


AATGCTCCAC 


GAATTGACGA 


TGAACGATAT 


AGTACAGGTA 


660 


GTAAACTTAT 


TAAAATAGAT 


CATCATCCTG 


CAGTTGATCA 


GTATGG TG AT 


ATTAATTTAG 


720 


TTAATACGAA 


CGCGTCATCT 


ACAAGTGAAA 


TCATTTATGA 


TTTAATCTCA 


CATTTTAATG 


780 


ATGAAGCAAT 


TGTTAATAAA 


GGCACGAGCG 


AGTGTTTT AT 


ACCTTGGTAT 


CGTCGGTGAT 


840 


ACTGGGCGAT 


TCCTTTTTAA 


CAATACCTCA 


GAACATACTA 


TGGAAATTGC 


TGGAAAGTTA 


900 


ATTGGGCATG 


ATATTGATCA 


TAATGCGC T A 


TTAAATAAAA 


TGATGGAGAA 


GGACCCAAAA 


960 


ATGTTGCCGT 


TTCAAGGTTA 


TGTTTTACAA 


CATTTCGAAC 


TTATGGATGA 


TGGATTCTGC 


1020 


CAAGTTAAAA 


TAACTGAAGA 


TGTATTGGAG 


CAATTCGGTA 


TTCAGCCAAA 


TGAAGCATCT 


1080 


CAGTTTGTTA 


ATACAATTGC 


TGACATCAAA 


GGTTTGAAAA 


TATGGGTGTT 


TCCAAGTTGT 


1140 


TGAGGGTAGT 


GAAATAAGAT 


GTCGCTTACG 


CTCTAAAGGG 


CAATTGATTA 


TTAATGATAT 


1200 


TGCGC AAGAT 


TTTGGTGGCG 


GTGCCCATCC 


GGAATGCGTC 


AGGAGTTTCA 


GTGAACAGCT 


1260 


GGGATGAATT 


TGAGCAACTT 


GCTACAGCTT 


TACGCACAAA 


ACTTAACTAA 


TAGAAAGGAG 


1320 


CCATTCAATC 


ATGGTGGCAT 


ATTTAAATAT 


TCATACGGCT 


TATGATTTGT 


TAAATTCAAG 


1380 


CTTAAAAATA 


GAAGATGCCG 


TAAGACTTGC 


TGTGTCTGAA 


AATGTTGATG 


CACTTGCCAT 


1440 


AACTGACACC 


AATGTATTGT 


ATGGTTTTCC 


TAAATTTTAT 


GATGCATGTA 


TAGCAAATAA 


1500 


CATTAAACCG 


ATTTTTGGTA 


TGACAATATA 


TGTGACAAAT 


GGATTAAATA 


CAGTCGAAAC 


1560 


AGTTGTTCTA 


GCTAAAAATA 


ATGATGGATT 


AAAAGATTTG 


TATCAACTAT 


CATCGGAAAT 


1620 


AAAAATGAAA 


ACAATGGAAA 


ATGTTTCATT 


TGAACTATTG 


CAACAATTTT 


CATCGAATTT 


1680 


GATTATCATT 


TTTAAAAATG 


TTGCAGACGA 


ACATCGTGAC 


ATTGTTCAAG 


TTTGTGATTC 


1740 


GCATGAAGAT 


ACGTATTTAG 


ATCATCAAAG 


TGTTTTAGTT 


CAGGGTATAA 


AGCACGTATG 


1800 


GATTCAAAAT 


GTTTGTTACC 


AAACACGTCA 


AGATGCCGAT 


ACGATTTCTG 


CATTAGCAGC 


1860 


TATTAGAGAC 


AATGCAAAAT 


TAGACTTAAT 


TCATGATCAA 


GAAGATTTTG 


GTGCACATTT 


1920 


TTTAACTGAA 


AAGGAAATTA 


AACAATTAGA 


TATTAACCAA 


GAATATTTAA 


CGCAGGTTGA 


1980 


TGTTATAGCT 


CAAAAGTGTA 


ATGCAGAATT 


AAAATATCAT 


CAATCTCTAC 


TTCCTC AATA 


2040 


TCAGACAACT 


AACGATGAAT 


CAGCTAAAAA 


ATATTTGGTG 


GCGTGTCTTA 


GTTACACAAA 


2100 


TTGAAAAAAT 


TAGAACTTAA 


TTATGACGTC 


TATTTAGAGC 


GATGGAATAT 


GAGTATTAAG 


2160 
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TTATTACTAA TATGGGTT ... 2178 

(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1528 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:200: 



30 



35 



45 



50 



TCATTTTTTT 


GATTGAGCCA 


AAAGGAGTCC 


CCCCTGTGAG 


CGGAATATTT 


AATTTATTGA 


60 


GCTATTTAAT 


ATTAAACGTA 


CGCTTAACCC 


CCCTANAGTG 


ATATCGTTTC 


TAGCGTTGTC 


120 


ATTATCATTA 


AGCGAAACAT 


TTTAAAGACA 


AATACACACT 


GTACGATCAC 


CAAACTGCAT 


180 


GTCGAACAAT 


GTAACATTTG 


GATTCGATAT 


TTAAAATTGC 


TTGTGATGAT 


AAACTTTCTC 


240 


ATTTAGAAAA 


CGCTTCCACG 


TACATTCAAA 


AAAATAACTT 


TGTTAAC C AT 


ATTGTAACAT 


300 


TATTTCATAT 


ATTTTGGGGC 


ATGAGAATGA 


TTCTCACGCC 


CAGTAATTTA 


TTTATGCAAT 


360 


TGTTCATGTA 


GGTTCTTTGC 


GACX5TTTTCA 


GGAATACCTA 


TATTTTTAAA 


ATCTTCAAGT 


420 


GTAGCTCGTG 


CCTTCATTTT 


CTTGATTGAA 


CCGAATGAAC 


GCAATAATAA 


TGTTTTAACG 


480 


TTTGTTACCG 


ATACCATCTA 


TATCATCAAG 


TATTGATTTC 


AAGCCTGTCT 


TTTGACGTGT 


540 


TTGTCTATGA 


AATGTGATTG 


CGAATCTGTG 


AACCTCATCT 


TGGATACGGA 


TGCAACAAAT 


600 


AAAATGCCTG 


CCTATTTTTC 


TTCAGTGGAC 


AATTTCCTGC 


ACTAGCGCCA 


TAATAATAAT 


660 


TCAGATGTTT 


GGTGTTTATC 


ATTTTTCTGC 


AAACCTGCAA 


CAGGGATATC 


AAGACCTAAT 


720 


TCGTTTTGTA 


GCACATCAAT 


AACCCCGTTC 


ATATGTCCTT 


TACCACCATC 


GACTATTATT 


780 


AAATCAGGTA 


ATGGTAATCC 


TTCGTTTAAA 


ACGCGAGAAT 


ATCGTCGTCT 


TACTACTTCT 


840 


CTCATTGATT 


TGTAATCATC 


TGGACCTTTA 


ACCGTTTTGA 


TTTTATACTT 


TCTATAATTT 


900 


TTCTTATCTG 


GTTTACCGTC 


GACAAATGTA 


ACCATTGCTG 


ACACTGGATC 


CACACCTTGA 


960 


ATATTAGAAT 


TATCGAATGC 


TTCAATTCTA 


ATTGGTGTTT 


GAATTCCCAT 


TTGTGTTCCA 


1020 


AGTTCTTCAA 


TAGCTTTAAT 


CGTTCTTGAC 


TCATCACGTG 


ATATTAATTC 


AAATTTATTA 


1080 


TTTAAGGATA 


CTTTAGCGTT 


ATGTGCAGCT 


AGGTCAACCA 


TATCTTTTTT 


GGGAC C TCGC 


1140 


GCGGGTTGAA 


CGATTTTAGT 


GTCCACAACA 


GATTGAATCA 


TTTCTTTATC 


CAAATTACGT 


1200 


GGTACATGAA 


CTTCCTTAGG 


TAAAATATGT 


TGGTTTAAGC 


TATAAAATTG 


TCCAATAAAT 


1260 


GTATAAAATT 


CTTCTTCTTC 


TGTTTGGTGT 


AATGGAATCA 


TCGGNGTATC 


TCGCTTTATC 


1320 


ATATTACCTT 


GTCGGATTAA 


AGAAAACTTG 


GATACACATC 


CATCCTTTAT 


CAACACAATA 


1380 


AC CAAAGAC A 


TCACGAATCG 


GNTTATCTGA 


TGACATAATT 


TNGGGTGGGT 


GGGCAAGATT 


1440 


TGGATATTGT 


TGGATTAAAT 


CTCTATATTC 


TTNAGCCGCN 


CAAAATCAAG 


TGATTC AC TT 


1500 


GGAGGNCACA 


TNCGCTCTTC 


TAANCTTT 








1528 



(2) INFORMATION FOR SEQ ID NO: 201: 
(i) SEQUENCE CHARACTERISTICS: 
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.(A) LENGTH: 2933 base pairs 
(B) TYPE: nucleic acid 
5 (C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 01: 

AGGTTGATAC ATATGTC TTG TAAATACATG TGTCATTGCT TGACCAATTG CAGCAATCTG 60 

is GACCCAGGAT NGGTGCCTAC CCAGTTATCT CCAAATTTCA CATTTCTGCG TAATCTGGTG 120 

CTACTTGAAA TGACTTTTGT ACCTTAATAG CGGACTTC TG TCATAAAATG TGCATCCGGA 180 

GTACGTGTTA AAGGTACATT AGAGCCCCAC ATAATAATGT ATGATGCGTT ATACCAGTCA 240 

CTTGATTCAG GCACATCTGT TTGCTCTCCC CAAATTTGTG GAGAGGCAGG TGGTAAATCT 300 

GCATACCAGT CATAAAAACT AAGCATTTCA CCACCAAGCA AATTGATGAA TCGAGCACCT 3 60 

20 

GCTGCATAAC TAATCATTGA CATCGCTGGA ATAGGTGTAA ATCCTGCGAT TCGATCTGGA 420 

CCATATTTTT TTATTGTATA CAGTAATTGT GCTGCGATTA TCTCTGTAAC GTCTTTCCAA 480 

TTTGAACGCA CGTGCCCTCC CATACCTCGG GCTTGCTTAT ATTGTTTGGC TTTGTCTTCA 540 

TTTTCAACAA TAGACGCCCA TGCAGCAACG CGATTACCAT AGTTTTCTTC TAATGCTTCA 600 

25 

GTC CAT AAAT CCCAGAGTTT TCCACGAATA TATGGATATT TGATTCGAAG CGG AC TG T AT 660 

TCATACCAAG AGAATGACGC ACCTCGTGGA CATCCTCTCG GTTCATATTC AGGCATATCC 720 

GGACCACAAC TTGGATAGTC AGTTTGTTGA TTTTCCCAGG TAATCACACC ATTTTTCACA 7 80 

AATACTTTCC AAGAACATGA GCC TGTAC AG TTAACACCAT GTGTTGTTCT TACTTCTTTA 840 

30 

TCGTGGCTCC AACGTTCTCT GTACATTTTT TCCCATTCTC TACTTTTACT TTCTAGGATC 900 

GACCAATTCC CATTAAATTT TCTGTTGGCT TAAAGAATTC AATCCAAATT TTCCCATATT 960 

TATATCCTCC TACGTATAAA AAATACGATG TGTAGATGTC GTGTTNTTAA AT AC TTT AAA 1020 

ATGCCCAAGA CTATTGCTTT AATTAGATTG TACATTTTTT CACAAATATA AAAT ATT AGG 1080 

35 

GAATCACCTA ATTACTTAAG GAATTTCCCT ATCAATAACG GGATTTCATT GAAATAATAC 1140 

ACAATCATGT ATGGTCGTGC TTATTGCCAA TCTAAATCGT TCAAATTTGC CACAACGACA 12 00 

AATAAGGCTT CAACACGAAT ATATTCTCTC GGTTGAAACC TT AC TTATTC ATTTATTTTT 12 60 

TATAAATTAG TGACATAACA CTGTATTAGC ATCTGCACGA TCGGTTGAAA TATATGTTAC 1320 

40 

ATTTTCTTGC TGCTTAATAA ATGCATCATA G TAATC AT AT TGCCGACCGG AATGATATGT 1380 

CCCATTCGAT GTATCATTTG GGTTTAGCAA ACAGCCATAA CCTTCGTCAT ATAAATGTTC 1440 

ACAGAGCATA AGGGCGTCAT GTCTAGAACC ACTTACTACA TAAAATTGCT GAGTCGGATT 1500 

AGAAGTTGTT GAGACATTTT C AGTATAAC C CACTACTTCG CCTATAATAC ATATACCTGG 1560 

45 

TTTCGCCTCA ATTGCGTAAT GCTCCAGTTT CCCAACAATA TTACTTAAAC GCCCCTTAAC 1620 

AACAAACTCG TTAAAACACG ATGCTTGAAA GACAATCGCT ATCGGATAAT CAATATCTGT 1680 

GTACTGTTGT ATCTGGGCGA TAATTTTCCC TAAACGTTTA ACTCCCATAT ATATCGCCAA 1740 

CGTACCACCA TTTACTAAAG NGTTGACATC TACGTCATTT TCTTCTGAAT CTNTNAAGTG 1800 

SO 

ACCTGTAGAA AATGTCNCAC TCTTAGCAAC AGCACGCNTC GNTAATC CTG TTTGCATCGT 1860 

AGCAGCTGCA GCACTCGCAA GATGTCACAC CAGGTACAAT GTCANACGCA ATAGTAAAAG 1920 

TTGTTTAGTG TGTCAACTTC TTCTTGAACT CGNCCAAATA TTGCTGGGTC GCCACCTTTA 1980 

55 ACCCTCACCA ACCTTGTTAT ATCGACGTGC TGCTTC C AC G ATACATTCAT TTATTTTTTC 2040 

TTGCTGAATG TGTTTTGCAT ACGGC TTTTT ACCAACATCG ATAATTTCAG TAGTCAAATT 2100 
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CGCATATTGT 


AAAATTAACG 


GATTCACTAA 


TCGATCATAT 


AGTATAACAT 


CCGCTTCAQG 


2160 


TATTAAACGC 


TCAGCCTTTT 


TAGTCAAATA 


ATTCGGATTA 


CCTGGTCCCG 


CACCTATCAA 


2220 


G T AAACC TTG 


CCATATTTAT 


C TACAGAC AT 


ATATATACGT 


TCCCGTCTGT 


AACTTCTACC 


2280 


TCATAAACAT 


CTACACAACC 


TTCATCAGGT 


TCTTGTACAA 


TCCCAGTATT 


TAAATCAATC 


2340 


TTTTG ATC AT 


GAAGTGGACA 


GAATACATAT 


TCACCACTCA 


CTGTTCCCTC 


TGACAATGGT 


2400 


CCTTGTTTAT 


GCGGACAAAT 


ATTGTGGAAT 


CGCGTGANTT 


GTACCACTTT 


CAGATAAAAA 


2460 


CCAATCCTAC 


CTCTTTGCCT 


TTGNCAATAA 


CCTTTTTTCC 


AATTAGGGGT 


GTTAATTCAT 


2520 


CTATAGTTGT 


CACTTTAATT 


TTTTC TTTTG 


TTTCCATGTA 


TTACACCTTC 


TCCACTTCAA 


2580 


AAATTTTACG 


TGCTTGCGCA 


TTGCTAGNTA 


TTGCTTCCCA 


AGGTTCAGCT 


TCGACTGCTT 


2640 


TTTTAGCATC 


CATAATGCGT 


TCAAATAGTT 


CATTTTGTCT 


TTCTGGGTCA 


AGTAAGACTT 


2700 


CTTTTACATT 


TTCAAATCCA 


AGTCTTCTTA 


ACCATGGCGC 


TGTTCTTTCA 


GCATATATAC 


2760 


CTGTTTCGCG 


ATATAGTGCA 


TCAAAGCACC 


ACATAATGTG 


NTTACTTCAT 


CTTCTGTTTC 


2820 


TACAGTTGTT 


AAAAATTCAG 


CTTTTTCGAC 


TTCTGTACCA 


CCATTACCAC 


CGATATAGAT 


2880 


TTGGAATCCA 


TTTTCAACTG 


AGATAATACC 


AAAATC TTT A 


ACACCTGATT 


CAG 


2933 



(2) INFORMATION FOR SEQ ID NO: 2 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1866 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 02: 
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TAAAGTTCNT 


TCCCCTGCTN 


AATGTCCAAN 


TTTCCCCCAT 


AGCAACTTTC 


AAATATTTTT 


60 


CATGACTTGC 


TTTAGCCCAA 


TCAAGTTCTT 


TACCTGAAGG 


TATATTAAAT 


TGATTTGTTG 


120 


AAAAGTTCCA 


AAAATTCTGC 


GCTTGGGTAA 


GTCCTTGTGG 


GACAATTTTT 


TGAAATCCTC 


180 


CAACTTCTTA 


AATATTTCTG 


GTGATTTTTG 


ATTAAAACTC 


ACGTAATTTA 


CGTAGTTTCT 


240 


CTTCTAATTC 


ATGTTTTTTG 


TTGACCTAAT 


GTTTGTATTA 


TTTGTTGGTT 


CGATGAAATG 


300 


GCTTGCTGAT 


TATCGGAAGC 


ATGCTTTTTC 


AAATTG TT AG 


TTAAATTTTC 


ATATCGCGTA 


360 


ATTTGTTGAC 


TTAATGATCT 


GATATCTTCT 


TCAAGCTCTG 


ATTCTTTTAA 


AGATATGCTA 


420 


TCAACCTCAC 


TCGTATAACG 


TGACACAAAA 


TTATCGCAAG 


CTTGCTTCGT 


TAAATCACTC 


480 


AATGTTTTCA 


TACTTGTTGA 


TAATGGAATT 


AACACCGTAC 


TAAAAAATTG 


C TTAGCTG AC 


540 


GTATACGCTT 


TCCCTTTAAG 


CGCATCATCA 


TTGATAAATT 


GAGTAATTGC 


TTTTTCCAAC 


600 


GCATCATAAT 


TTGAATTCAT 


TGTTTGACTC 


AAATTCCCCA 


CACTTGAAGC 


TTGGTTTCGA 


660 


GATCTGTCTA 


AATACATGTC 


AATACTCATC 


GGCATGCTCC 


TTTTTCAAAA 


ATATATGATT 


720 


TTCAAACTAT 


TTAAAATCAA 


ATGCTTTTTA 


CATCTACAAA 


GTTGTAAAAT 


TTTAAAACTC 


780 


GGCGATGATT 


ATTTCTTATG 


TAAAGGAGTC 


TAGATGCAGG 


TAAATTGAGA 


TAACATGTCG 


840 


CCTTTTTTCT 


TATTTTAGCA 


TATGGATATA 


ATGGTGTCTT 


TGTATATTCG 


CAATTAATCA 


900 


ATAAAAATTA 


TCTTTCAATA 


TTTTAATTTT 


ATTGCGACAA 


CATCCTTAAC 


ATTAAATATA 


960 


TTAATATCTC 


AAAATATATT 


CACTATTAAA 


ATATGTCATC 


AGTTGTTAAA 


AGTATTTCCT 


1020 
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CATCATGCGA 


AATATCAAAA 


CGTATCTAAA 


ATACGAATAG 


TTTATCAATC 


ACACAACATC 


1080 


ATCATCCAAA 


ATTTNATTGC 


TCTCATNTNC 


AATATTCATT 


TCATATCTAT 


CAGTTCATAC 


1140 


ATAATTAAAA 


TTTCAATTGT 


GCAATCTCAC 


CGTTAATGCA 


TCAACTTTCA 


AACAATAAAT 


1200 


CATCACAATA 


ACCACACCTA 


ATTCAACACT 


TTTCAAACAT 


AAGTATTGAC 


ACATTGAGCA 


1260 


AAATGATTTT 


TAATTGTAAC 


TAATACAGTT 


ACAATTATGA 


GGTGAGAAAC 


ATTGAATTTA 


1320 


GAATTTAACA 


TTGCCGTGCA 


TGTATTAGCT 


TTTTTAACTA 


AGCATCATTC 


AG AAAAA TTC 


1380 


AATAGTAGTT 


CATTAGCAGA 


ATTAACTTGT 


TTANATCCTG 


TTCAATTACG 


ACGCGTGACG 


1440 


ACTCAACTTG 


TCGATTTAAN 


AATGATTGAC 


ACAATACGAG 


GTAAAGATGG 


CGGTT ATTT A 


1500 


GCAAATGATC 


AAAGTGCTGA 


TGTCTCTCTA 


GCAACATTAT 


ATAAACATTT 


TGTC TT AG AG 


1560 


AAAGAACAAC 


ACACACGTCT 


ATT T AC TTGG 


CGACGAAGGC 


AGTCACTGTC 


AAATTGCTCG 


1620 


TAATATTGCA 


ACTACCATGT 


CACATTATCA 


GCAAGACGAA 


CAGAATATCA 


TTATTAATTT 


1680 


TTATAATGAA 


AAAACAATCA 


AAGATGTCAT 


TGAAGACATT 


CAAAAGGAGG 


ATTTATGTCA 


1740 


TGAAAACATA 


TGATTTAATT 


GTAATAGGAT 


TTGGGNAAGC 


TGGTAAAACT 


TTAGCGAAAT 


1800 


ATGCTGCATC 


AACAGGTCAA 


CACGTTCGCA 


GGTATCGAAC 


AATCTTCCAA 


AATGTATGGN 


1860 


GGCACT 












1866 



(2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 
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ACAATGCCTC 


CAAGAGGTAC 


ANCCGATAAC 


TGTCTCACAA 


AACTAAATAA 


ATTTAACTCT 


60 


GTGTCCTCTT 


CCCATCCGTT 


CACTTTAATA 


TCTTCTCCAT 


AAGCATCAAC 


GGAAAGATAA 


120 


AANCCGGACC 


TGGAAATGTA 


TGTGCCATCT 


CTTTAACCCA 


ATCAGTATCT 


TGAATGCCTT 


180 


TCGNTCCCAA 


CCTATGCAAT 


AATTAATCCC 


TGCGGCAAAG 


TAGTCCATGA 


TTTGGTGACT 


240 


TCGTACGAAT 


GCCACCACCT 


ACTTCAATAT 


CTTTGGTTGT 


TAATCTCCTC 


AATGACTTAA 


300 


TATAATCAAA 


CTCTCGGGCA 


TACTGTGCCT 


TAGCACCTAT 


CAAGTCGACG 


ATATGAATAC 


360 


GATTCACACA 


TTCAAATTGA 


CTATAGTAAG 


CAATACTTCC 


TTCAGCCGAA 


CGCGACATTT 


420 


TTTCTTCACT 


ATCATATTTA 


CCCTCTGTTA 


ACCTCACACT 


TGTTGACCCA 


ATCAAATCAA 


480 


TCGCTGGCCA 


TAATTCAATC 


ATTT AT AAA T 


CCCCCTTGTA 


TTGCCTGACG 


CAAAATTTGT 


540 


AACCCATATG 


TACCGCTTTT 


TTCAGGATGG 


AATTGAATAC 


CAATATAATT 


GTTAAATTGA 


600 


ACAATTGCCG 


GAATATCAGC 


CCCATACTGC 


GCATATGCAA 


TGACATTTTC 


TGACATCGGC 


660 


GCTTGGTAAG 


AATGTACGAA 


GTAAACATCT 


TGATTTAACA 


TAGGATGCTT 


AC TC ACT AAA 


720 


TTATTCCAAC 


CTAAGTGCGG 


CACTGGGTAT 


TCTG TTGGGA 


TACGCGAAAT 


ATTTCCTGGG 


780 


ATAAACCCTA 


ATCCAGATGC 


ATCGCCTTCA 


TCACTATGCT 


CATACATTAA 


TTGC ATGCCT 


840 


AAACAAATAC 


CAATCATCTT 


CTTATCAGTA 


TTCTTAGCCA 


ATATTGCATT 


GAGATTTAAT 


900 


CGTTTTATCT 


CTGACATCGC 


ATCTTTAAAA 


TGGCCGACAC 


CGGGC AATAT 


GATTGTTTCT 


960 
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GCTTGATCGA TTATTTTTGA GGTATTTGAG ACAACCACCT CATACCCTAA ATGTTCAATA 102 0 

GCGCGTTTTA CATTACTAAT ATTCCCTAAT CCATAATCAA CGATG AC AAT CATTCAATCA 10 80 

CACCTTTCGA TGACGGCACA CGCTGATCAT CAGTTGCAGT TAGCGCTATG CCTAATGCAC 1140 

GGGAAAACGC TTTGAATATA GCTTCAATTT CATGGTGTGT ATTACCTCCA CGAATTAAAT 1200 

CAATATGCGT TGTTAATCTT GCATTGATTA CGACCGCTCT AAAAAATTCT TCTACTAACT 12 60 

CCGTATCAAA CGTACCAACT TTTTC TTT AC TTAATGATGC ATTGAATGAT AGGTATGGGC 1320 

GCCCACTTAT ATCCACAACG ACACGTGCTA ATGTTTCATC CATTGGAATG TACATCGTTC 13 80 

CATAACGAAC GAAATGCTTT TTATCTTTAA TCATTTCAAG TAACAATTGG CCAATGACAA 1440 

TGCCGATATC TTCAGTTACA TGG TGATC AT CTACGTCAAT ATCACCTTGT GCCTCAATGT 1500 

TTAATGACAG ACCGCTATGA AATGTAAACA ATGTTAACAT ATGATTTAAA AAGCCCACAC 1560 

CTGTATTAAT ATGCGATGGT GACTGGTCAT CTGATATTGA AATATTTAGT TGGGGTTTCA 1620 

GCTGTGTTTC GGTGGTTGTG AATAAATCAT ATTGTGCGCT CCANTCTTTA ACAATNTCTT 1680 

CCTAATTGCT TTAACTGCGA TGCTGTTGCA ATTGAGTATC TTACATAGCC CTTCATCAAC 174 0 

CGGCTCAACA TAAAAGCGAG GTTTAAATCC TTGTTCAAAT ACGTATTGTC CTAATTGGTG 1800 

CGCTGCTGAA CCTTTAGTAA GTACAAAATT GGCATTTGAT GGGAATACTG ACATTTTATC 1860 

TGCAACATGT GTATCAAATA TTTGTTTTAA CTGCTCAGCT AACTGTCGTT GCATCGTTAA 1920 

AAATACTCTT GTCTCTTCTC TATGTCTAAA AATATAAGTC GCAATATTTA GCGTAAATAC 1980 

ATTTAATGGA TATGGATGTT CTATTTTTTG AATATGCTTT ATCGTTCCAG CAGTACTAAT 2 040 

TAAGACACCT AATCTTAAGC CGGCAATTCC AAACGCCTTT GATAATGTAC GCATTCTTAA 2100 

GATGTGTGGT GCCAGTTCCA CGTCATATGC CGTACCATAA TCTAAATATG CTTCATCAAT 216 0 

GACAAAGTAT CCGTTTAATG CTTTCATCTT ATCTGCAATA GCTGTTAAAA ATGCCGNATC 222 0 

AAATTGCTTG CCTGAAGGGT TATGTGGATT ACTCATAATA AAAAATGATG GNTGTNCTTC 2280 

ATCGATTTTC GTTAAAATGG TTTCCAAATC AAACGTNAAA TCTGATCCNG CATCTTACAA 2340 

ATGCAATTTC ACGATTTACT TGTGCCGCAT ATGCTTGATA CATAAAAAAA TCAGGATTTA 2400 

GCGTTAATGC CGGACCTTCT GGCATGATCA GCATTAACTT TTGANTCANT TCATCAGATC 2460 

CATTTCCTGC AATAANTTGT TCAGGCGATA ACCCGTAAAA CTTAGCATNA GCTTCCGTGA 2520 

ATTGTTCATA TGCTGCATCA GGATATAAAT TATATGGCGT TGCACTAATA ATAGACGTCA 2 580 

TTGTTTTTTC ATCCAACGGC GTAACTGGAC TTTCATTTTT ATCAATATAA ATCATTTCAA 2 640 

TTACACTCCC CTAAGACTGA CGTATTAAAA TAGACTGCTG GTGATTGTAT AATGCTTCAA 2700 

CATGAGCAAT ATGTTGTGCT GAATCAGCAA TTTGTTCAAA CGTATCTTTT GATAAATGGA 27 60 

TGACCGTGTT CCGTGTTAAG AAATCATTGA CCGATAACCC ATTGGTAAAT CTAGCTGTTC 2 820 

TATTTGTAGG TAATACATGA CTTGGACCTG CAACGTAATC TCCTATGACC TCTGGCGAAT 2880 

AATGTCCAAT AAACAATGCA CCCACATATT TCACTTNCNC AATATATGGN TGAGGATTTA 2940 

CTG TGTGAAT CGAC GCATGT TCAGGCGCGA TCGTGCCTTT CATGACATGG 2990 

(2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3386 base pairs 
{ B) TYPE: nucleic acid 
(C) STRAND EDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 04: 



5 





GCATCGGTTA 


AATGGCGTCA 


TTCCACTTAC 


GGGCATCTTN 


AGGCATTTTG 


CTTTTCAAGT 


60 




GGCTAATTTA 


GCTTCACCGT 


TTAAGGC TAT 


TTTTAGCATC 


AGTCACTTGA 


GTTAATGCAC 


120 




GGAGTAATGT 


CATCTGGGTT 


AAGCGTTGGA 


TTTGTCGTTT 


GATTGATAAT 


ACCTTCCGCT 


180 


10 


TGAGACACTG 


C ATTTG TATA 


TGCTGTTTGC 


TTATCGACAT 


CAGCATCGTG 


GTAGTTCTCA 


240 




CTTGCTTTCA 


CAGTATCTTT 


ATTTGCAATA 


CCTTGTTTCA 


AGTTACCCAT 


TGCTGTATTC 


300 




ACATTTTGTG 


CATTTTTGTT 


TAGTTTGTCC 


AACCTGTAGC 


TAAAGCTTGT 


GCTTGATTGA 


360 




NTTGGTTTCG 


TAATGCATCA 


CGTTGTGGTG 


ATTTAAGTCA 


CGTNACGTAT 


CAAGATTTGC 


420 


15 


TAAAGCATCT 


TGTTTC GCTT 


GCGCTAATTT 


TTCATCACCG 


TTTAATGC AT 


CTTTCGCTTG 


480 




ATTCATAGTT 


GTAACTGCTT 


TTGTTACTGT 


ATCAGGCGCC 


AATACAGGTG 


TTGGTGTGCC 


540 




ATTTGCGATT 


TGTTGCGCTT 


GGTTAGCTGC 


ATTGTTATAA 


GCTTGTTGTT 


TGTCTTGATC 


600 




CGCTTGTGTA 


AAGTCAACTG 


ATTGTGGTAC 


TTGACTATTC 


GCTTGAATTT 


GTTGTGTCAA 


660 


20 


CGTACCCATT 


GCATTATTTA 


ACGCATCAGC 


ATTTTGCTTA 


ATAGCATTAA 


CACCTGTAAC 


720 




AAGTTCTGCA 


TGCGACACTT 


GGGTCTTTTA 


ATGCTGTTTT 


TTGTGGTTGA 


TTTAAGTTTG 


780 




GTAACTGATC 


AATGGCTGTA 


TTTGNATTGT 


CTTTAGCAAC 


TTGTAAGTTA 


TGGTTACCGT 


840 




TTAAATCACC 


CTTAGCTCGA 


GCCACTTGTT 


GTAACGCTTG 


AGCCACTTGT 


TGTGGACCCA 


900 


25 


CGCTTGCATT 


TGGGTGNACC 


ACTAATGATT 


TGGTCCAGCA 


TGCGCTACCG 


CATTTGAAAA 


960 


ATGCATTTTT 


GCTTATCAGT 


ATCAGCATTT 


GCCAAAGTTA 


CCATTTGCTT 


NAGGTGGTCT 


1020 




TAATCTGCAA 


TACCTTGTNT 


CAATTGTGTC 


ATTGCTTTGA 


NNTAAATCTT 


TGTGC TGTTT 


1080 




GCTTAATCGT 


GTTNACATTT 


GCAACAGTTT 


GCGCAGCATC 


GACTTGTTGT 


TTAAAGGCAT 


1140 


30 


CTTTGTGCGC 


TGGATTTAAA 


TCATTTGCAT 


TTGTAATAAC 


TTGTTTCGCT 


GCATCTTTTG 


1200 


CACGTTGTAA 


CATTGTCATT 


ACCATTCAAT 


GCTTGTTTCG 


GCTTCGTTCA 


CACGTTGCAT 


1260 




CGCTTGTTTC 


CAACTTCAGT 


TTGAACTTGC 


ATTGCTGCCA 


TTTGCTTTAG 


ATAAAATACC 


1320 




TTCTGCATGT 


GAAACCGCAT 


TAGTATAATC 


ATCTTTCTTA 


GCTNGATCTG 


CATCAGTAAA 


1380 


35 


GTTAATGCTA 


TCTAATGTGT 


CAGTTTTATC 


TTGTAATGCA 


TTGTTTAAGT 


TAGTCATAGC 


1440 


ATTATTCAGA 


TTTTGAGCAT 


TTTGCTTAAC 


ATTATTAACT 


GTTGCAATAT 


CTGGTGCTTG 


1500 




TTCAACTTGA 


GTTGTTAGCG 


CTTGTTTTTG 


AGCATCATTT 


AAATGATCTA 


GTGTACCAAG 


1560 




TGTTGTCTTA 


GCATCTTGTT 


TGGCATCAGC 


TAATTTGTGA 


TTACCGTTAA 


GTGCTTGATT 


1620 


40 


TGTGTCATTC 


ACTTTAGTTA 


ATGCTTGTTG 


AATTGTATTT 


GGATCCATTG 


TTGGATTTGT 


1680 


TGTTTGTTTT 


AACAATTCTT 


CTGCTTGNGT 


TACTGCATTA 


TCATATGCTG 


TTTGCTTGTC 


1740 




AGTATCAGCA 


TCGTGATAGN 


TCTCGCTACC 


TTTAATTTGT 


GCTTTATTCG 


CAATACCTTG 


1800 




TTTCAATTGT 


GTCATCGCGT 


CATTAAGCGT 


TGTTGCCGCT 


TGTTGAATAG 


CATTGACGTT 


1860 


45 


TGGTACAAGT 


GTTGCTTGCG 


TGATTTGTTT 


GCTGTATTCA 


TCACGTTGCG 


CTTGGTTTAA 


1920 




GTTTGGCAAT 


TGATCAATAG 


CATGTTGAAC 


ATTTTGTTTC 


GCCGTTGCTA 


AGTTTGTATT 


1980 




ACCATTTAAA 


TCATTTTTAG 


CTTGCGTAAC 


TNNATCTAAC 


GCTGCAGGAA 


TTTCGCTAGG 


2040 




TGTAACGACA 


ACATCAGGCG 


T AC C ACT AAT 


TAATGCTTCA 


GCTTTCGCTA 


CTGCTTGATT 


2100 


50 


ATATGCATTT 


TTCTTGTCAG 


AATCTGCATT 


GACAAAGTTA 


CCATCAGCTT 


TTGTTTGTTC 


2160 




TTTATCTGCA 


ATGCCTTGTT 


TTAATTGTGT 


CATCGCATTG 


TNTAACTCTT 


GCGCCGTTTG 


2220 




TTTAACATTG 


TTTACACCAG 


CTACAGTAGT 


TGCATTTCGT 


ACTTGTTGTT 


TTAATGC ATC 


2280 




TTTCTGTGCT 


TGGTTAAGAT 


C ATT AG AG TT 


ATTAATTAAT 


GCTGTTGCTT 


CGTCTTNTGC 


2340 


55 


ATGTTGAACG 


TTGGCATTAC 


CATTTAATGC 


TTGTTNTGCT 


GCATTTACTT 


GTTGGATTGC 


2400 




TTGNNCAACT 


TCAGTTTGTG 


TTGCATTACC 


ACCATTAGCT 


TGTGGAAATG 


AT ATT TNC AG 


2460 
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CATTTGTAAC 


TGCAGTGTTA 


TATGCTTGTT 


GTTGNGCTTG 


ATC TNNATC A GNGAAGTTAC 


2520 


CTGNTGCTNG 


CGTAGCATCT 


ATATCACCTA 


TCGCTTGTCA 


CAAGTTACCC 


ATCGCAGTAT 


2580 


TTAAGTTTTG 


CGCATTTTGC 


TTAATTTGAT 


TTGCTTCATC 


AC C AGTATGC 


GCGCCATTAA 


2640 


TTTGATTAGT 


AACAGCTTGT 


TTTTGCGCAT 


TATTTAAGNT 


GTCTAATGAA 


CCTAAAGATT 


2700 


GCGTTGCTAG 


TTGCTTCGCT 


GCCTCTAAGT 


TTTCATTACC 


ATTTAAAGCA 


TTTTTAGACG 


2760 


TGCTTACTTG 


TCCAGCAGCT 


TGATTGATAA 


CAGTCGGATC 


TAATGAAGGG 


TTTGTAGTTT 


2820 


GATCAATAAT 


ACCTTGTGCA 


GTTGTGACAG 


CATTATTGTA 


CGCATCCTTT 


TTATTCGGAC 


2880 


TTGCATCAGT 


ATAATTTTGG 


TTTTGTTTTG 


TTGTCGCATT 


ATCTGCAATA 


CTTTGACGTA 


2940 


ATTTGTCCAT 


TGCTGCATCA 


ACATTGTTTG 


CTTTTTGTTC 


ATTACCTTGT 


GCTTCTGCAA 


3000 


C AG T AGTCGA 


TTGTTGTACC 


AATTGTTTTA 


ATGCCTCTTT 


TTGTGCATTT 


GTTAAATGGC 


3060 


TTAAACCGTC 


AATTGCTGTA 


TTTGCGTGTT 


GTTTCGCTTT 


TTCAAGGTTT 


TGAGTACCAT 


3120 


TAAGCGCTGC 


TTTTGTAGTA 


TTCACTTGAT 


TCATAGCTGC 


TTCAACTTGA 


TC TTTAGGC A 


3180 


CGTTCGTACC 


TGTAGATTTA 


TTTAAAATAT 


TTTCAGCATT 


ACGAACCGCT 


TCATTGTATG 


3240 


CATTTTNCTT 


CTCTGGATCT 


TGCATC TTGC 


AAAGTNNNGG 


ACTGGCAAGT 


GTAGTGTCTT 


3300 


TATCATTCAA 


GCTATTTTTC 


AAGTTANCCA 


TCGCTGTTTC 


AAAATCCTGA 


GNGGGNTTGA 


3360 


TTGGTTGGNG 


TTNACTTCAN 


CTACAG 








3386 



(2) INFORMATION FOR SEQ ID NO: 2 05: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 761 base pairs 

( B) TYPE: nucleic acid 
3Q (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:205: 
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AAAATACAAG 


CGTCTAAATA 


TTAACAATCT 


NCCTGTGACG 


CTATATGGCA 


T ATC TTGC TT 


60 


TTTAAGCATC 


TGTTGTATCT 


CTGCCATCGA 


GCGCATTGGC 


CAGCTTCACA 


ATAATTATAA 


120 


TGAATTGTTT 


TTATCGATGG 


AGAC AC TAAT 


TGTGTCATTG 


GGTCACCACC 


ANAACCATAT 


180 


ATTTGATGTG 


GTTTCATAAC 


ATCATTTTTA 


TCAGAATAAT 


ATTTATAGGC 


AGC TAATGC A 


240 


ATCGTATCAG 


TTGCTCCAAC 


ANCCGCATCT 


ACTTGCTCCA 


CATTTTCCAA 


AACATTTGCA 


300 


ACATCTTTTT 


GTGCTTCCAC 


ATAAGTAAAA 


TTTGTTTCAT 


GTATATTAGG 


TTTAATTTGG 


360 


TATTTAGCTA 


ACTGGTCCAA 


GTAAACCCAC 


GTTTTCTATG 


AATACCACCT 


GCAAT ATC TT 


420 


TTTCACTTAC 


ACTAAACACT 


TCAACTTGTT 


GATATC CCTG 


TTGACCAATC 


CATTCGCCTA 


480 


TAATTTGACC 


TGCTTTATAA 


TCATCATGCA 


CAATACTATG 


AAGTTGTTCA 


TGTTGTTGAC 


540 


CAACAATAAC 


GATTGGTACA 


TTCATTTTAT 


TAATGACTTC 


AATATGTCTC 


TCTGTTATGT 


600 


CTGTAGCCAT 


TAAAACAATA 


CCATCTACTT 


TACTGCGTGC 


TAATGTTTCA 


AGCGCTTGTA 


660 


TTTCTGCTTC 


GATATTTAAA 


CCTGTGTAAT 


TTAAAATTAA 


TTGTGATTCA 


TATTTTTGGC 


720 


ATTGTTTTGC 


CAATCCTTTG 


ATTGTTTCAT 


CTACTGCATA 


T 




761 



55 

(2) INFORMATION FOR SEQ ID NO: 2 06: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2334 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 06: 



CCACCCANCT 


GATTATAATG 


TTTTAGCANG 


AGCTAGAC TT 


GGTTGG TTAC 


CATCATATCC 


60 


ACAATTTAAT 


AAAAATAGTT 


TGTTGTTTGC 


AGAAGAAGCT 


AAAGATGAAG 


GCATTGAGTC 


120 


GAATGAGGCA 


ATTTTAAAAC 


GAGCGATAAA 


TGGAAGTTAA 


GTCAAAACAA 


ACGCAATTTG 


180 


CGATAGAAGA 


TCCGGATTTG 


AAAAAGAATC 


ATCCGGAAAT 


CACTGTTTAT 


ATGGCGCTCA 


240 


AATCTAATCT 


CAAGTTCTGC 


AAAAGGTCAA 


GAATACTTTA 


TGAAGCATTT 


ACTTGGCACA 


300 


AAATCAGGGT 


TATTAGCTAC 


ACCAAATGAA 


GATGAAAAGC 


CAGAAGAAAT 


TACGTGGCGT 


360 


GAGGAAACAA 


CAGGGAAATT 


AGATTTAGTC 


GTTTCTTTAG 


ATTTCAGAAT 


GACAGCAACA 


420 


CCTTTATATT 


CTGACATTGT 


TTTGCCAGCA 


GCGAC TTGGT 


ATGAGAAGCA 


TGATTTGTCA 


480 


TCTACAGATA 


TGCATCCATA 


TGTACATCCT 


TTTAATCCAG 


CTATTGATCC 


ATTATGGGAA 


540 


TCGCGTTCAG 


ACTGGGATAT 


TTATAAAACG 


TTGGCAAAAG 


CATTTTCAGA 


AATGGCAAAA 


600 


GACTATTTAC 


CTGGAACGTT 


TAAAGATGTT 


GTGACAACTC 


CAC TTAGTC A 


TGATACAAAG 


660 


CAAGAAATTT 


CAACACCATA 


CGGCGTAGTG 


AAAGATTGGT 


CGAAGGGTGA 


AATTGAAGCG 


720 


GTACC TGG AC 


GTACAATGCC 


TAACTTTGCA 


ATTGTAGAAC 


GCGACTACAC 


TAAAATTTAC 


780 


GACAAATATG 


TCACGCTTGG 


TCCTGTACTT 


GAAAAAGGGA 


AAGTTGGAGC 


ACATGGTGTA 


840 


AGTTTCGGTG 


TCAGTGAACA 


ATATGAAGAA 


TTAAAAAGTA 


TGTTAGGTAC 


GTGGAGTGAT 


900 


ACAAATGATG 


ATTCTGTGAG 


AGCGAATCGT 


CCGCGTATTG 


ATACAGCACG 


TAATGTAGCA 


960 


GATGCAATAC 


TAAGTATTTC 


ATCTGCTACG 


AATGGTAAAT 


TATCACAAAA 


ATCATATGAA 


1020 


GATCTTGAAG 


AACAAACTGG 


AATGCCGTTA 


AAAGATATTT 


CTAGCGAACG 


TGCTGCTGAG 


1080 


AAAATTCGTT 


TTTAAATATA 


ACTTCACAAC 


CACGAGAAGT 


AATACCGACA 


GCAGTATTCC 


1140 


CAGGTTCAAA 


TAAACAAGGT 


CGACGATATT 


CACCATTTAC 


AACGAATATA 


GAACGTC TAG 


1200 


TACCTTTTAG 


AACATTAACA 


GGACGTCAAA 


GTTATTATGT 


GGATCACGAA 


GTTTTCCAAC 


1260 


AATTTGGGGA 


GAGCTTACCA 


GTATATAAAC 


CGACATTGCC 


GCCAATGGTA 


TTTGGGAATA 


1320 


GAGATAAGAA 


AATTAANGGT 


GGTACAGATG 


CTTTGGTACT 


GCGTTATTTA 


ACGCCTCATG 


1380 


GANAATGGAA 


TAT AC AC TC A 


ATGTATCAAG 


ATAATAAGCA 


TATGTTGACA 


C TATTTAG AG 


1440 


GTGTCCACCG 


G TTTGG AT AT 


CANATGAAGA 


TGCTGNAAAA 


CACGATATCC 


AAGATAATGA 


1500 


TTGGCTAGAA 


GTGTATANCC 


GTAATGGTGT 


TGTAACGGCA 


AGAGCAGTTA 


TTTCGCATCG 


1560 


TATGCCTAAA 


GGTACAATGT 


TTATGTATCA 


TGCACAAGAT 


AAACATATTC 


AAACGCCTGG 


1620 


GTCAGAAATT 


ACAGATACAC 


GTGGTGGTTC 


ACACAACGCG 


CCGACTAGAA 


TCCATTTGAA 


1680 


ACCAACACAA 


CTAGTCGGAG 


GATACGCACA 


AATTAGTTAT 


CACTTTAATT 


ATTATGGACC 


1740 


AATTGGGAAC 


CAAAGGGATT 


TATATGTAGC 


AGTTAGAAAG 


ATGAAGGAGG 


TTAATTGGCT 


1800 


TGAAGATTAA 


AGCGCAAGTT 


GCGATGGTAT 


TAAATTTAGA 


TAAATGCATA 


GGATGCCATA 


1860 


CGTGTAGTGT 


GACATGTAAA 


AACACTTGGA 


CAAATCGTCC 


AGGTGCTGAG 


TAACATGTGG 


1920 


TTCAATAACG 


TAGAAACGAA 


GCCAGGTGTA 


GGGTATCCGA 


AACGTTGGGA 


AGACCAAGAA 


1980 
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CACTACAAAG 


GTGGTTGGGT 


ACTAAANTCG 


TAAAGGGAAA 


CTTGAATTAA 


AATCTGGAAG 


2040 


TAGAATTTCA 


CAAATTGCTT 


TAGGTAAAAT 


TTTTTATAAC 


CCAGATATNC 


CATTAATAAA 


2100 


AGATTATTAT 


GANCCATGGA 


NCTATAATTA 


TGAACATTTA 


ACAACTGCGA 


AATCAGGGAA 


2160 


GCATTCGCCA 


GTTGCTAGAG 


CGTATTCAGA 


AATTACAGGG 


GAT AAC AT TG 


AAATTGAATG 


2220 


GGGACCTAAC 


TGGGAAGATG 


ACTTAGCAGG 


TGGTCATGTT 


ACAGGCCCAA 


AAGATCCTAA 


2280 


CATACACAAA 


ATAGAAGAAG 


AGATTAAATT 


CCAATTTGAC 


GAAACTT TTA 


TGAG 


2334 


(2) INFORMATION FOR SEQ 


ID NO: 207: 








(i) SEQUENCE CHARACTERISTICS: 








(A) 


LENGTH: 2698 base pairs 








(B) 


TYPE: nucleic acid 










(C) 


STRAND EDNESS : single 










(D) 


TOPOLOGY : 


linear 










(ii) 


MOLECULE TYPE: Genomic DNA 








(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 207: 






GGCACGAGAC 


TAATACCTGA 


AATTAATCCA 


CCACCACCAA 


TAGCGGCAAA 


TAAATAATCA 


60 


AAGTTAACGG 


AATTGTCAGT 


CTTTGATTGT 


TCTAGCATTT 


CTTTAGCAAG 


CGTACCTTGT 


120 


CCAGAAATTG 


TATGAACATT 


ATTGAATGGA 


TCTATAAAGT 


TCATTTGATG 


TTCACTTGTA 


180 


TAAGTTAAAG 


CTTCAGCTAA 


ACAGNGATCA 


AATGTATCAC 


CAGTGAGTAC 


AACTTCAACG 


240 


TTACTATTTC 


CAAAGAACTT 


TAC TTGATTT 


ACCTTTTGTN 


AAGGTGTAGT 


GACTGGCATA 


300 


AAGATAACAG 


CGTTTANATT 


AAGTTTTTTA 


GCTGTATAGG 


CAACACCTTG 


AGCATGATTA 


360 


CCTGCACTCG 


CACATGTAAT 


ACCTTTACTT 


TTAGCTTCAT 


CTGATAAAAC 


AGAAATAGCG 


420 


TTGTAAGCAC 


CTCTTAATTT 


AAAAGAACGT 


ACCCATTGTA 


AATCTTCTCG 


TTTTAAATAA 


480 


ACTTTACAAT 


CATACTTTTG 


AGATAAGTAA 


TGG TCTAATT 


GTAAAGGTGT 


TTCTTTGACA 


540 


ATATCTTTAA 


GTCTTAAAAA 


TGCTTCATCG 


ATATCTTNCG 


TAGAAACTGN 


TGTTTTGACT 


600 


GGCATAATAT 


TCAACTCCCT 


TAAAGTGATT 


TNNC AT ATTN 


NTCTATTAAT 


GATTCATATT 


660 


GTAGGGTGAT 


TGCAATTGTC 


ATC TAAGCC A 


TTTACC AAGT 


TTATTTTTCC 


AAAGTTTCAT 


720 


CAATTTCAAA 


ATGGAAACGC 


TTGTCTGGTG 


ATGACACAGT 


TTGATTTGGT 


AAATCTATTT 


780 


CAATTTCCTC 


ATATTGTGCA 


AGATGTTCAC 


GAGCACTTTT 


TTC TAAANCG 


ATAGGCAACA 


840 


TCGCATTTTT 


AGTGCAATTC 


ATATAAAATA 


TGTCACTGAA 


ACTTCCTGCA 


ATAATAATAT 


900 


GNAAACCATA 


GTCCTTAAGA 


GCCCAAGCAG 


CATGTTCACG 


ACTTGAACCA 


CATC C AAAAT 


960 


TATCTCCAGT 


AATTAAAATA 


GAAGCCCCTT 


TATATTGTGG 


TTTGTTAGGA 


TTGAAATCAG 


1020 


GATTATCTGA 


ACCATCAGGT 


AAGTAC CGCC 


ATTCATCAAA 


AGCAAATGGA 


CCAAAGCCAC 


1080 


TTTTTGAAAT 


ACGCTTTAAG 


TGTACCTTAG 


GAATGATTTG 


GTCTGTATCG 


ATATTGTCGT 


1140 


TGAAGAGAGG 


GACTATTTTA 


CCTTTATATG 


TTGTAATAGG 


TTTGATTGCT 


GCCATTTAAA 


1200 


CAACCACCTT 


TCTTACGTCC 


ACAAATTTAC 


CATGAATAGC 


TGCTGCTGCT 


GCCATAGCAG 


1260 


GGGATACTAA 


ATGTGTTCTT 


GCACCTTTGC 


CTTGTCGTCC 


TTC AAAGTTT 


CGATTAC TTG 


1320 


TAGATGCACA 


ATGTACGCCC 


TCAGGTACTT 


GGTCAGGATT 


CATGCCTAAA 


CACATTGAAC 


1380 


ATCCTGGTTC 


ACGCCATTCA 


AATCCTGCAT 


TTTTAAAGAT 


AGTATCTAGA 


CCTAATTTTT 


1440 


CTGCTTCTTT 


TTTTACTGTA 


CGAGAACCTG 


GTACGACAAT 


AGCTGTAATA 


TTTGGATGAA 


1500 
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CTTTATTTCC 


TTTAACAATA 


TGACTAGCTT 


CAATCAAATC 


TGATAGTCTA 


GCATTTGTAjC 


1560 


ATGAACCGAG 


AAAAACATAC 


CCTAAGTCGA 


TGTCTTCAGC 


TTTTTGACCT 


GGTTCTAACC 


1620 


CCATATAATC 


ATACGCACGT 


TGATCGTTGA 


TATCATTGAT 


TTCAGGGAAT 


GGTTCACTGA 


1680 


AATTAACACC 


CATTTCAGGA 


TTAGTTCCCC 


ATGTCACTTG 


TGGTTCTAAT 


GTTGAAACAT 


1740 


CAAGTTCAAT 


TACACGATCA 


AATATCGCGT 


CGTGCCATCA 


GAATATAGCT 


CACGCCACTT 


1800 


ATCAACTGAT 


TTAGCGAAGT 


TATCGGCAAA 


TGGTCTCCCT 


TT AAC AT ATT 


CAAATGTTAT 


1860 


ATCATCAGGT 


TGGATTATGC 


CGTATTTGGC 


ACCACCTTCG 


ATAGCCATGT 


TACAAATAGT 


1920 


CATTCG AC CG 


TC C ATTG AAA 


GGTTTTTAAT 


TGTTTCGCCA 


GTAAATTCCA 


AAGCATAGCC 


1980 


TGTACCAAAA 


TCAACACCAT 


ACGTTTTAAT 


TAAATGCAGA 


ATAATGTCCT 


T AGC AT AG AC 


2040 


GCC TGTTGGT 


AAGGTACCAT 


TAATATCGAT 


TTTTAAGTTT 


TTGGG TTTTG 


TTTGCC AG 


2100 


CGTTTGAGTT 


GCGAAAACAT 


GTTCAACTTC 


ACTTGTTCCA 


ATTCCAAATG 


CAATAGCACC 


2160 


AAATGCTCCA 


TGTGTTGCTG 


TGTGAGAGTC 


ACCACAAACG 


ATTGTTTTGC 


CGGGC TGTGT 


2220 


AAGTCCTGTC 


TCAGGTCCTA 


CCATGTGAAC 


AATACCTTGT 


TCATCAGAGC 


CC ATATC AAA 


2280 


AATATGCACC 


CCAAAATCTA 


TGGCGTTTTT 


TTGTAATGTT 


GTGATTTGTT 


TGTTTGCAAT 


2340 


TTCATCTTTA 


ATATTGAAAA 


TATCAATAGT 


AGGAACATTG 


TGATCGAGTG 


TTGCAAATGT 


2400 


TAAATCTGGG 


CGTCTTAATT 


TTCTGTTTTG 


AAGCCTAAGT 


CCTTCAAATG 


C TTGAGGAG A 


2460 


AGTAACTTCA 


TGTATAAGGT 


GTAAATCAAT 


GTATAATAGT 


TGCGGGTCGC 


CCNATTTCCC 


2520 


GNATAACACA 


TGTCTGTTCC 


ACACCTTGTC 


AAATAATGTT 


TGACCCAGTT 


ACATCTCCCC 


2580 


CTNAGTGATT 


CAATTTTTGA 


GATAGCATTT 


CGAAAATATC 


AGTAGTATTC 


AATTTGCCGC 


2640 


CTAAATCTGC 


TGTCGTTTGC 


CCATGTTCAA 


TCATGTTATA 


AATATGTTGT 


TCTAATTC 


2698 



(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 
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CTAAGANGAT 


ANTTTAAGCT 


TGTAACCCAA 


CAGTTCGCAG 


GTTAAATCCA 


ATCCAGTTTG 


60 


ACCAAAGAAA 


TACCGACCCA 


AACCGCCTTT 


GNACCGTAGT 


TGGATAATTA 


ATGGGGACCG 


120 


C CC AAAGAT A 


ATAAGCAGTT 


AGCCGNNNAG 


GTTCCGGGAA 


TAGGNCCGTT 


TAAATTNAAA 


180 


ACCCAGCTGC 


CTTTTGCTTC 


TTGAAATACA 


TTGGGGAATC 


TGGATTAGAT 


GGTCCGAAAC 


240 


TACCGTCGAC 


ACCATCACGT 


CAAGCAGTTG 


CAATNCCAGC 


ATCAGATGTG 


ATTATAAACA 


300 


AAGTGATGAC 


GAGTGATACA 


CCTGTAACAA 


TTGTAGCGAC 


AGGTCCTCTT 


ACGAATGTAG 


360 


CAACGGCATT 


GATTCGTGAT 


CCAAGAATCG 


CTGAGCATAT 


TGAATCTATT 


AC TTTG ATGG 


420 


GTGGTGGTAC 


ATT TGGAAAT 


TGG ACCGC CT 


ACAGCAGAAT 


TCTATCAATA 


CCTACTTCTT 


480 


TCTTGACTAA 


ATCANACTGT 


GGC TTCGTTA 


ACATGCCACT 


TGGTGTGATA 


TAAAAACTAT 


540 


TTTTAAAGTA 


ATAGCTTACA 


GAGTGGTTCA 


AATGTTCAGC 


GAATAAAGCT 


TCATCCATAC 


600 


GTTCTAAGAA 


GAATGGGATA 


AACTCACCCC 


AATGTCCAAT 


AATCATATTT 


AACTTTGGAT 


660 
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AACGATCAAN AATGCCAGAT AATACTAGAT GTATTGCATR RDCTNS 7 06 

(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 634 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:209: 



20 



25 



30 



CTTGTTGGAT 


CAGATCCAAC 


ACTACCTGAT 


GGATCNCTAT 


CNAGATTGAG 


CACGCCACCA 


60 


TATCAGATGG 


AATATCAACC 


TTGTCATGGG 


ACAGACTGAC 


ACCTGAATGA 


NGGGGTGTCT 


120 


GGNCANNCAC 


ATCATCACCA 


TTAATGGGAC 


AGCGAGTGGC 


ACATNTCCAT 


GTAGTAAATG 


180 


GCACTTATTA 


CTTACACGGN 


CATATCGTGN 


CAGGTTGGCA 


AGGTGTGAAA 


AAGACATGTG 


240 


ATACAGCGGA 


AGAGCTTGAC 


ACATATATAA 


AGCANAGTGA 


TGTGGTATAT 


GAGGAACAGA 


300 


AGCATCTANC 


TTTATTTTAA 


GNGGGCGGAA 


ACAATGAAAC 


TCAANGTTAA 


AAGAGANATG 


360 


AGATTAGATG 


AATTAATTAA 


NTGGGCGAGA 


GAAAATCCGG 


ATCTATCACA 


NGGAAAAATA 


420 


TTTTTCNCAA 


CAAGGATTTA 


GTGATGGATT 


CCGTCCGNTT 


TCATCCANAT 


ACAAATAAGT 


480 


GTCTCGACGN 


CAAGTTTTAT 


TCCAATTGAT 


ATCCCCTTCA 


TAGTTGATAT 


TGAAAAAGAA 


540 


GTAACGGAAG 


AGAC TAAGGT 


TGATAGGTTG 


ATTGAATTAT 


TCGAGATTCA 


AGAAGGAGAC 


600 


TATAACTCTA 


CACTATATGA 


GAACACGRRD 


CTNS 






634 



(2) INFORMATION FOR SEQ ID NO: 210: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1157 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 
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GGCACGAGCG 


CCCATTTAAT 


TAATTCATCT 


AATCTCATTT 


CTTTTTTAAC 


TTTGATTTTC 


60 


ATTGTTATAT 


CTCCTCTTGA 


ACAGTAAATT 


TATCGTTAAT 


TGATACGTAT 


CCAGTCACAT 


120 


TACATAAGAT 


GCTATCAACA 


TCAAAAGTCA 


CACAACAGTT 


GCGTTCAACA 


TCATTTGAAT 


180 


AGAATCTTTT 


ATTACCTGAT 


AACTTGGCGG 


TTATCACCAA 


GCCCATTGAA 


TAAATTCAGC 


240 


TAAATCTCAT 


TTCTTTTTCA 


ACTTTGATTT 


TCATTGTTTC 


CGCCCTTTTA 


AAATAAAGTT 


300 


AGTTGCTTCT 


GTTCCTCATA 


TTCCAGAATC 


ACTTTGCTTT 


ATATATGTTT 


CAAGCTCTTC 


360 



176 



EP 0 841 394 A2 



5 



10 



15 



CGCTGTATCA 


AATGTCTTTT 


TC AC AC CTTG 


CCAACCTGGC 


AC GAT ATGAC 


CGTGAAAGTA 


420 


ATAAGTGCCA 


TTTACT AC AT 


GGATATGTGC 


CACTCGTTCG 


TTATCCTGAT 


ACAGATATCT 


480 


CTTAGAGCCG 


AAAAAATGTT 


TTAAGTATTC 


TTTACGTCCG 


CTATCTGTCA 


TGGTCATCAC 


540 


TCCCGCAAGT 


CAAATACTCT 


ATCGACGTAA 


AACTTCGCCT 


TTGCTAAATC 


CTCATGACCA 


600 


TTCTTTAACG 


GTGCTCTAGA 


CAAGTATTTA 


ATTGCATTAC 


CTATTGCGAA 


TGCTAATTGT 


660 


GGTGGGTACT 


GTGCCGTAAC 


TTGTTCAATA 


AAATCTATAA 


TTTCAATGTT 


CGCCGTATGT 


720 


TGTTAATGCG 


CAGGTTGCTT 


AACGTTG TC T 


TGCG TTTTGT 


TC ATATC T AC 


TTTTCTGTTA 


780 


CTGATTATGC 


TCATTATGTT 


TCACTCCATT 


TCTTGAACAT 


TTGGTTATAG 


TTGACATCGA 


840 


ACCAGTACGA 


TCACGTTGAA 


TGTTTTTGAG 


TACATCAAAT 


AATGTTGCTC 


CTTCTCTAGC 


900 


TCAGCTTCTT 


TACGTCGTAG 


CCTAGCCATT 


TCACGCTCTC 


GCTCCAAAGC 


TTTTGTTATT 


960 


TGTATTTCTC 


TATAGTCGTT 


TAGCTTCATG 


CCGAAAGGTG 


CATCAATTGC 


TTCCGACATC 


1020 


TCCCAACCCT 


TCGCAACTCT 


GTTTCTAACT 


ATTTCGGGCG 


TGAGTCCTTT 


CTTTTTCATC 


1080 


TGCTCATTTT 


CATATTCAGT 


GTATTTAGAA 


GGGGGTTTTT 


CTTGTGGTGG 


CGC AATAAGC 


1140 


GCATCGCCCG 


TTAGCCC 










1157 



(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 725 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:211: 



40 



45 



50 



GGCACGAGCT 


CGTGCCGATT 


CGGCACGAGC 


• TTCGGCTGTA 


TTGGTACCTA 


TCAATACAGA 


60 


CATTGCCGCT 


TTTTCTAAAT 


CTTCAATACT 


ACCTTGAGTC 


ACCCGCCGTG 


TCCAACCGTT 


120 


CTAAATAAGC 


CTTTTGTTGC 


AGGTGCTTGN 


CAATATCTTT 


GNACAAGCTA 


TCAACGTTAT 


180 


TTGTGCCAAT 


AACTTGTCTT 


GCTAATTTTT 


GCANTAAANA 


CGATTCTTCA 


TTCGTCGCTT 


240 


TAGAAGAAGA 


AATGAATGAT 


AGTGCATCTG 


GGCCATACTT 


TTCTTTAATA 


GATGTAAAAT 


300 


TATCTGCAAT 


GACATTTAAA 


GCTTCATCCC 


ATTCTACTTC 


ATGGAACTCA 


CCATTTTTCC 


360 


TTAC TAGTGG 


TTTAGTTAAG 


CGTTGATCTG 


AATTAATATG 


TCCCCATGAA 


AACTTACCTT 


420 


TAACACAAGT 


CGCAATTTTA 


TTTGCTGGAG 


AATCATGTGA 


TGGTTGTACT 


TTTAAAATTT 


480 


CTCTATCTTT 


AGTCCAAACT 


TCAAATGAAC 


AACCCACACC 


ACAATAAGTA 


CACACTGTTT 


540 


TAGTTTTCTT 


AATACGCTCT 


TAACGCAGTT 


CTGCTTCTGA 


ATC TGAGATT 


GCAAATAGTG 


600 


GACCATAACC 


AGGTTC TGCT 


TTTTTAGTTA 


AATCAATCAT 


TGCTGCTAAT 


GAACCAGGTT 


660 


CCGTATCAGT 


CATATAACCC 


GCATTACCTT 


CCAGATTCAC 


TTCCATCATG 


GC ATT AC A TG 


720 


GACAG 












725 



(2) INFORMATION FOR SEQ ID NO:212: 
(i) SEQUENCE CHARACTERISTICS: 
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/A) LENGTH: 1676 base pairs 
(B) TYPE: nucleic acid 
5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:212: 

AAAATCCAAG GCACAAAGTT CTAAATACGG AGTCAAAAGA ATATTGAACA AGCACATTTT 60 
?5 CAACAATACT TTGAAAATTT ATTATTTCAT AAGTTCATCA TTGATCCAAA GTAATATTGC 120 

CCTTGCCGAT GTTCATCTTA TTGACTATTT ATTTACAGCC ATTAGTTAGA TATATTTTTG 180 
AGCGAATTGT CATGGCTGTG ATTGTCATCA TTGGTGTTAT TGTCAGTGTG TTTACCATTC 240 
TGTATTTTTC ACCGCTTGAT GCGGCTTATA GCATACTGGG ACACAATGCA ACANAGGCAC 3 00 

2o AGATACATCA ATTCAATGTA TTACATCATC TTAACGAACC TTATTTTATT CAATTGTGGG 3 60 

ATACCATCAA GGGTGTTTTT ACCTTTGACT TAGGTACGAC TTACAAAGGG AATGAGGTTG 4 20 

TGACTAAAGC AGTTGGCGAA AGAATTCCAA TTACAATAAT TGTCGCAGTA TTAGCGCTAA 4 80 

TTGTGGCATT AATTATTGCA ATACCAATTG GTATTATCAG TGCGATGAAG CGAAATAGTT 540 
GGCTTGATAT CACGTTAATG ATAATTGCAT TAATTGGTTT ATCTATTCCA AGTTTCTGGC 600 

25 

AAGGGCTATT ATTCATTTTA GCGNTCTCAT TGAAATTGGA TATTTTGCCA CCATCTTATA 660 
TGCCAGAACA TCCAATATCG GTGGATTTTA CCTGTACTTG TCATTGGAAC AAGTATTGCT 72 0 

GCTTCTATCA CGCGTATTGA CAAGGTCTTC TTGTACTTGA AGGTAATGCG CAGCGATTAT 780 
GTTTTNACTG CTTATGCAAA AGGATTATCG ACGACACAAG TTGGTTATTA AACATATTGT 840 

30 

GGAAAAATGC CATTATTCCA ATTGTACGGT AAGTGGGTCT TCCTNGGTGG CAGAGTTACT 900 
AGGCGGTTCA GCAGTGACGG AACAAGTATT TAACATTAAT GGTATCGGTC GTTATATCGT 960 

CCAAAAACAA CTAATACCTG ATATCCCAGC AGTCATGGGT GGGGTCGTAT ATATATATCA 1020 

ATTGTAATAT CTTTAGCAAA CTTAATTATT GATATATTTT ATGCTTTAAT CGATCCAAAA 1080 

TTACGTAGTG AAATTAACGA AAGGAAGTGA GGCATATGGT AAAACTTACA ACAAAGATAG 1140 

CTTCCTTAAA ACTATTCGCA AGTTATGCTA TAGCAACTTA TATTTTAGTT ATGTTAACGA 1200 

GTGCATTAAA TCTTTTTAAA GGTTATGTGG CCGATACGTT C T AT ATTGC T GAAACATTGC 12 60 

TAATCATTTT AACCATCATT TTAATTATCA TTTTAACAAC GGAACAAACA TGGAAGCATC 1320 

ATGACCTATG GCGACGTATC GTCGAAGTGT TGTTATTGTT GATGGCATTA ACAGGCAACG 1380 

TATTTACATT ATTAATGTGT GTAAGTATTA GACGTTACCA ACGTACATCG CAAATACATA 1440 

GTTATAACGG GGTGGGAATC CGNTTATACG AAAAACT AC T AGACATCCGT ATTTGCGATT 1500 

ATCGGGGTTA CTTATTTTAG TCTACATGCT GACATTATCA ATNGTGTCAC AATTTAAATT 1560 

TGATACGACA ATGGCTTACT AAAAATCCAG TCCAATGAAC TNGTACAATG GACCGAGTCT 1620 

AGCCTATCCG TTTGGTACTG ATGATTTCGG TAGAGACTTA TTTACACGCT CGTGCC 167 6 



(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1978 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
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iD) TOPOLOGY: linear ... .... . 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:213: 

CTAATCGTAT GAACGAGTAC CCAGAGGTTT GAAATTTAAT ATTGATTCTA TNAATGATTC 60 

CTTAGTGTCG CATAACGGTG CAAGAGCACG ATAC TTAGG A TCAATAAAAC CTTCGTCAAT 120 

CATATGGTCA ATCATTGNTT GTAGTGGNTT GAAAANGCCA TTAATATTAT AAATGGCAAT 180 

AGGCTTGTCA TGGATACCTA TTTGAGCCCA ACTAATACAT TTCGAAAAAT TCTTCTAGTG 240 

AACCCTGCGC CACCAGGAGC CATGACAAAT GCATCTGCAA GTTCTGCCAT TTTATTTTTA 3 00 

CGTTCATGCA TAGAATCCAA CTAAAATTAA TTCAGTTAAA CGTTGGCTTG TGATTTCATG 3 60 

TTCATCTAAC ATTTTAGGCA TGACGCCAAT AGCTTTGCCG CCATGATTTA ATACACCATC 420 

TTGAATGGCA CCCATAATGC CAATTGACCC TGCACCAAAT ACTAATTCAT AACCTTGTTC 4 80 

AGCAAAATAT TTACCTAAAT CGTATGCTTT TTGTACATAT GAAGGGTCAT GACCTTTGCT 540 

TGCACCACAA TAAACTGCGA TTCGTTTCAT GTTAATCCAG CTCC TTAATT CGATGAATGA 600 

CTTTTAATAG TGATTGTTCA AACACTTTTT GATCTTGCTT TGTAAAAGGT GGGGGACCTT 6 60 

TGTGGCGACC ACCTTGTTTT CTAATTTGTG CATTCATATA TCGTTTATCT AATAGTTGTT 720 

GAATATTTTT GG AATTGTAT ATCTTCCCAT TATGATGCAT GACAATTAAG ACTTTGTCGA 780 

CTAATAAACT TGCGAGTCCA TAATCTTGAG TGACTACGAT ATCATCCTTC GTTGATAATT 840 

GAACAATTTT GTAATCAACT GCATCTGGTC CATCATCAAC ATATAATGTT GATACATGTG 900 

GAGGATATAA TTGGTTCGAA AAATGGCTGA AGCTCCGAAT AATTGTCACA AAAATGCCTG 960 

TCTCAGTTGT TAAATCTATA ATAGAATCNA CAACAGGACA AGCATCTCCA TCAATAATAA 1020 

TATGTGTCAC AATTATGCCT CTGTATTGTT TTCTTTATTT TGTTGAGAGG CGCTTTTGGC 10 80 

AACATAATCT TTATATTTTT TAAATGACTT GATGCGTGCT TTATCAGCTT CTTGTTGGCG 1140 

TTTGTTCTTC TTTGTGTCGT TTTTCAATAT TTTTTTGTAA CTTTTTATTC ATTTTAGCGA 1200 

TTTCTTTGCG ATTTTTTTCA GCTAGTTTAT CGCCTTTTCT CTCAGTTTTC TCATCTAATT 12 60 

TATTAGGTGT TAAGCCTGCT TTTTCTTCGT ATTTTTGTGA TTTTTTCATA TCTTTAATAC 1320 

GTTGTATTTC ATTCTTTTCG CGGGCTTTTT GCTCTTCTTT ATGACGCTTT TCGATATTTT 13 80 

TTTGAAGTAT TTTATTCATT TTATCAGCGT CTTTACGATT TTGTTTAGCT AATTTTTCGC 1440 

CTTTTTTCTC AATATAGGCA GGATCATGTT C TCTAGC AAA CTTTTTAAGT TCACGTTTAT 15 00 

TTTCAAAATC TTGTTTTTTA TC GC CG AC AT ATTCTTTAAC ATCACTCGCT GTGTTACTGA 15 60 

TTGCTGCAGA TGTTTTTGAA GCAACTTTAC TTGGTAGCAT CTGTAACTTT TGGGACGTCC 162 0 

GGATGTTGTT TGATACGTTT ACGTTCAACA ATTAACGGTA CCAATACAAT TGGTAATACA 16 80 

TTAATCATAA ATTTGATGAC TTTTTTCTTA TCCATAGATC TTGCCTCCAT AATTACTTTA 1740 

TTAATTTTAC ATACCCTATG ATACATCAAT ATAAACGATG ATAGTAGTGA NTCACTATTA 1800 

AGTATTTCAG ATGTTTTTTA NAAGAAGACA ATAAAAACTG CCAATCACGT GGTTCCTTAA 1860 

TTGACAGTCT ATATTTTANA CAGGAAATTA AATACCTTTA CCAATGCCAA TCCGAAGTAA 1920 

AGTATAGCAA TANAGATTAC TAATACANTT CTGCTAAATG GCANATGGAA TTAGTCTG 1978 

(2) INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2558 base pairs 



179 



r 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 



5 


\ u i 


TOPOLOGY : 


linear 












( ii ) 


MOLECULE TYPE: Genomic DNA 








10 


(xi ) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 214: 








GGCACGAGCG 


ATGTGCCCAG 


CTTTTTTCAA 


TTGCAATACC 


ATCGGTCCTT 


GATATTGTTC 


60 




TTCTGGTACG 


TCTTTAAGTC 


CTAATGTTAA 


TTCCACTTCT 


TTTGCATCAT 


CTGACCATAA 


120 




1. X X. J. X \J^TLCTUTl 


TC TGTTTC T A 


CTAAATCTTC 


TTGATATACA 


TTTTCTGGTA 


ATGGTGGTAA 


180 


15 


CCATGGACGC 


TTAACTTCTT 


C AATTTC TAA 


TCGTGTTGTA 


ATAGATTCGA 


TATGATCGAT 


240 




AACCGCTTCT 


AAC TCGGTTT 


GGTTTTCTTT 


CGTTTCTTCA 


TCTTCAAGTC 


CACTCAAGTC 


300 




TTTGTTGATT 


GCTTGAAGTT 


GACCATAGTC 


ATTAATCATG 


TAAATCGTCT 


TATCTTCAAC 


360 


20 


TTCTAATTTA 


TCGCCTTCGA 


TGTCATATGT 


TGCACCACTC 


CATGCAGATT 


GGAATAATTC 


420 


ATAAATTTCA 


TT ATT AC C AA 


CTTTGTAAAT 


ACGCACGACC 


TGGTAATGTA 


ATGTCTGCTG 


480 




CATCTGGTGT 


TTTTAAAATT 


TCATTACTGT 


CTTGTCTATC 


TGGNACGAGT 


TNTAATGCCA 


540 




ATTTAAATTN 


AGAGTTAGAC 


CAAATTTGGT 


CATC AAC AAC 


ACCCGATGGT 


TTNTGTGTCG 


600 


25 


CAAGTATTAA 


ATGAATACCT 


AACGAACGTC 


CAATACGTGN 


CGGNGATACA 


AGTTCTNTNC 


660 


ACATAAAATC 


AGGTTGNTCT 


GATTTTAATT 


CGGCAAACTC 


ATCGGAAATA 


ATGAATAAAT 


720 




GTGGCATTGG 


CTCTGNTGCA 


AC ACCTTC TT 


TAAATAACTT 


ATGGTATTGA 


TTAATATGGT 


780 




TAACATCATG 


CTCTCCGAAT 


AAACGTTGAC 


GTTTCTCAAT 


TCGGCTTTGA 


TTGATGTTAA 


840 




GGCACGCATC 


GCTTCATCGC 


CATCTAAGTT 


TGTAATCCGT 


ACCAACTAAA 


TGGACTAAAT 


900 


30 


CTTTAAATAA 


GTTCGCCATA 


CCCCCACCTT 


TATAGTCAAT 


CAATAGGAAC 


GCAACTTCAT 


960 




GTGGATGAAA 


ATTAATAGCT 


AAAGATAAAA 


TGTATGATTG 


m TV T\ fTV ''IV' TV 

GATAATCTLA 


AGATTTCCCT 


1020 




GAACCAGTGG 


TACCAGCAAC 


TAAACCATGT 


GGCCCGTGTG 


CTTTTTCATG 


TAAGTTCAAT 


1080 




GATAAAATAT 


CATC T TT ACC 


TCTTACACCT 


AAAGGTACTG 


CCATCGTTTT 


GTATGTTTCG 


1140 


35 


TTTTGTCTCC 


ATCGATTAAC 


CACATCAAGC 


TGATCTACTT 


CTTTCACGTT 


ATACATCTCT 


1200 




AAAAATGTAA 


TACTATCAGG 


AATTGCATTT 


TTCAAATGTT 


CGACGTGTAT 


CAAATTCGCC 


1260 




AAACGTCTCG 


CGATGATATT 


CTTTATCAAC 


GTTATCAATA 


TTTTCAGGTG 


TAAATTTCAA 


1320 




TTGAACTAAT 


TCTTTTTCTT 


TCGTAATCAG 


TTCGCCTTCA 


GTACGAGACT 


TGATATCAAT 


1380 


40 


AATGGTATCT 


ACATGCTCTG 


GCAAACTTTC 


AATCACATCT 


TCAACAAAGA 


TTAATGAAAT 


1440 




ACCATATTCT 


G ATNAATC TT 


GGNTNAC AT A 


GTCTAAAATG 


ACATGATCAA 


TAATTAATGA 


1500 




CATATCTGTA 


ATGACAAACA 


CTAATTGCGG 


TGTGAAAATA 


ATTTGCTCAT 


TACTTCTGCT 


1560 




GCGTTCACGC 


ACAGCTTGGA 


TACGTTCTTT 


AATCATGCTA 


TAAATTGACG 


TTAAAATTTG 


1620 


45 


GTCACGTGTT 


CGGTGATTGT 


AAACAAACCC 


TCTAATGTTT 


TGACCTCTCA 


ATGTCATATG 


1680 




TGGCAACCAA 


CGTGCCCATT 


TCAATGTTTC 


AACTTCATCT 


TCACGTGTCA 


CAAATAGAAA 


1740 




CTCTAAATCA 


TGATAACTAT 


GGAACGTTGA 


CAATTGGATT 


AGCATTTTCT 


CCAATTCTTC 


1800 




TAAAATGAGA 


TGTCGGGCAC 


CAATATATGC 


AATTGGTCCA 


TGATTTAAAT 


CATTGATTAA 


1860 


SO 


TGGGTGCTTT 


GTCCNACANC 


TNTGANAAAT 


TCATACAATT 


CTTTAGCATC 


GTCGAATAGT 


1920 




TCATCACGAC 


GTTGGTTAAA 


TTCTCCNTCT 


TGGTAATCTA 


ATTTGAATGA 


CTTTTCTACA 


1980 




TTCGCAATAC 


CTAACTTATA 


ATGTAAGAAA 


TCGTGATGAT 


GCGATGTTTT 


TTCATATATT 


2040 




CTTGGTGCTT 


TCGTTTCAAC 


GATATCTTTA 


ATTTCANCAA 


CCGTTGGATA 


ATGGTAATTC 


2100 


55 


AAACTAAAAC 


GTTGTGCTTT 


AATCGCTTTA 


TT AATTTC TT 


TAGATTTATT 


ATCCAAATAA 


2160 



180 
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GCTTTGTAAT CTTTCTCTCG TTTTTCAACA TCTTTGTTAT ACTTTTTCTT TTCAGAGAAA 2220 

TACGTTGTAA TACCAAATAC T ATCGTT AC T GTACTCATAC CAATCATCAT TAAAATATAA 2280 

ATACCAATTG GTCTCACTAA AAAGATGACA ACAGTTAAAG CAATCATTAC TAATGGCGGT 2 340 

ATAATGGAAC GCCATATCAC TGTATTCCGG AATTCCGGAA TTC CGTATTG GCTGTGGCGG 2 400 

TCTTTCAATC TTAATATCAT CGGTCGGGTC NCGGTGNATA ATC CTTGGCG AGCGATGGTA 2460 

CGTATTGTAA TCATCTGCCT GTGCATGTGG CATCTCTTGT GTTAAGCGAA TTAATGACGA 252 0 

TGCCACTGTA TTCTGACTCA ATACANTTAA ACCATCAG 2558 

(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:215: 



CCGGCTGATT ATTAGCGAAG CGAGATTTTG AAGAGATGCA ACCATTATAT GAATGGTTAG 60 

GTTTAACTGC TTCATTAGGT TTTGTTGATA TTGTGGATTA TGAGTATCAA AAAGGGAAAA 120 

ACGTAATATA TATGAACATG ATATTATAAA TACAACTAAT GGACGTCTCG GTTTTGATTA 180 

30 TTTAATTGAT AATTTGGCTG ATAGTGCGGA AGGGAAATTT TTACCACAAT TAAATTACGG 240 

TATTATTGAT GAAGTGGATT CAATCATTTT AGATGCTGCT CAAACACCAT TAGTTATTTC 3 00 

GGGTGCACCA AGATTACAAT CAAATCTATT TCATATTGTG AAAGAGTTTG TAGATACATT 360 

GATTGAAGAC GTGCATTTTA AAATGAAGAA GACCAAAAAA GAAATATGGC TGTTAAATCA 420 

35 AGGTATTGAA GCGGCACAAT CATACTTTAA TGTTGNAGAT TTATATAGCG AACAAGCGAT 480 

GGTCCTAGTG CGTAATATTA ATTTAGCACT GCGCTCGTGC CGAATGTTCC TTGGTTCATA 540 

TAAGTCATAT CGATGTAAAA ATGTTCTTAA ATTTGGGTGC ATGATTGGAG AACAACCAGG 600 

TGGATAATCT TAGTCATTTT CAAGGGTGCA TTCCCATTAA ACTTATCATA TCGGCAAATT 660 

40 CCCGCTTT 



(2) INFORMATION FOR SEQ ID NO: 216: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2606 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:216: 
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CCCAAATCCT TTGAAATAAA ACTTAATTAA GTTCAAGAAN TCGNGGTGAT TTTCCCCGGA 60 

AGTTTGTTTG AATATTATTA TGGATTGATT TGATAGAGCT TATGGACCCC ACATTAATAT 120 

TGAAGTCCTG GGGCCAAGTT CACNCCCCCG TGGGTGG AAT TTCCTATTTG ATCCCTAAAC 180 

CCAAACTATG GACGTCCCAA ATTCTAAATA TGAAAAAATG GCTGAGCATC GTTTGATGAA 240 

TCATGATTTA TATCCCGAAA AAATAGATAA TCGTTAAATG TATCATTTAA T AAAC AC AC C 300 

AATAAGTTGA TTTTCCTAAC TTATTGGTGT GTGTTTTTCA TTTAGCATAC ATAATAGGTT 360 

ACATTAAAAT AACATTTTAT ACCCAAAGTA CACCAAAAGA ATATTAGTAC AC G AATT ANA 420 

CAACATTTTT ATAGAAACCT ATTGCACTTT AACGTCAATA AGTATATTTT TATATTATCT 480 

CTAATTAATT GTGCGCGCTT AATAACAGAA TATTCTCAAT ATTTTTATTT TTTTGTGATT 540 

TGTTGGAATA TTT AGTTGAT AAGGCACAAT CAAATTTACT TAAACTATTG TATTAGGGGA 600 

AGAAAGGATG GGATGTATAC ATGACACAAC AAAACTCCCA TGGAAATCAA ATTCAAGACA 660 

TACCTCAAAC AGGATTTTTC GGGCATCCTC GAGGACTAGG CGTACTCTTC TTTGTAGAGT 720 

TCTGGGAAAG GTTTAGTTAT TATGGCATGC GTGCCCTACT CATTTTCTAC ATGTACTTTG 7 80 

CCGTAACAGA TAATGGCCTT GGAATTGATA AAACAACAGC TATGTCAATT ATGTC AGTTT 840 

ATGGTTCATT AATCTATATG ACATCCAATA CCAGGCGGAA TGGATTGCTG ACAGAATTAC 900 

AGGCACTAGA AGCGCTACTT TATTAGGTGC AGTCTTTATT ATTATCCGAC ATATTTGTTT 960 

AAGCTTACCA TTTGCATTAA TCCGCTTATT CACATCAATG TTCTNCATCA TTATTGGCTC 1020 

AGGTTTAATG AAGCCAAACA TTCCAAATAT CGGTGGCCGT TTATATCCTG AAAATGATAG 1080 

ACGTATGGAT GCAGGTTGTG TTATTTTCTA TATGTCAGTT AATATGGGTG CATTATTATC 1140 

ACCTATTATT G TGC AAC ACT GTGTTAATGT TAAAAACTTC CACGGCGGAT TCTTGATTGC 1200 

AGCAGTTGGT ATGGCATTAG GTTTAGTATG GTATGTACTT NACANCCGCA AAAACTTAGG 12 60 

TAGCGGTGGT ATTGAAACCN ACTAACCCAT TGACNACCAG CTTGNAAAGA AAAAGTATGG 1320 

TCTTTATTAT CGGAAGTGGT GTCTTTAGCA ANTGTATTAA TTATCGGTAT TGGGGCATTA 13 80 

ACTAACTCAA TATCAANTAA CTTAGGTAGG AATAC TGTTT TAGTATTAGG TATTGCATTA 1440 

CCANTCATTT AC TTG ACTTT AATTATTAGA AGTANAGATG TCANNGATAC TGNACGTTCT 1500 

CGTGTTAAAG CATTTATTCC ATTATTTATT CTTGGAATGG TGTTCTGGGC TATTCAAGAA 1560 

CAAGGGTCTA ATGTATTAAA CATATATGGA ANTGNACATT CAGATATGAA ATTAAACTTA 1620 

TTTGGTTGGA AAAC AN AC TT TGGTGAAGCG ATCTTCCAAT CAATTAACCC ATTATTTATT 1680 

TTATTATTAG C TC C AATT AT TTCACTTTTA TGGCAAAAGC NTGGAACTAA ACAACCTAGC 1740 

CTGCCAGTAA AATT TGC AAT TGGTACGTTC TTAGCAGGTG CGTCATACAT ACTAATTGGT 1800 

40 ATTGTAGGTT ATGCATCAGG TTCATCAAAC TTCTCAGTTA ACTGGGTTAT TTTATCGTAC 1860 

ATTATTTGTG TTATCGGTGA GCTATGCTTA TCACCAACGG GTAATAGTGC TGCTGTTAAG 1920 

TTAGCACCTA AAGCATTTAA TGCCCAAATG ATGAGTATTT GGTACTTAAC TAACGCTTCT 1980 

GCACAAGCAA TCAATGGTAC TTTAGTTAAG TTAATAGAAC CACTAGGTCA AACAAATTAC 2040 

45 TTTATTTTCC TAGGTGTTGT TGCAATTATT GTTACAACAA TTTGTATTAG CATTCTCACC 2100 

TTTAATCATC AAAGC GATGA AAGGTATACG TTAATATTGT TGGCCTAATT CAAAAAACAG 2160 

TAAGTCATTT AAATGGCTTA CTGTGTTTTT ATAGGTTTCT ATTAATTAAA TTCAAGATAT 2220 

CAGTATAAAT AAAAGCTTAA TATGCTCGTT ATAGACAGCT ATAACTATAT TTTCTCGTCC 2280 

50 CACTCTATAG ACATCAATGT CAGTTATTAC CATTTTCTCT ATTTAAAAAC ATATTTTTGG 2340 

TATTAACAAT TAAAAGTGGG TATATATATT TAATGAACCA ATT T TT AG AA AGTAGGCTAA 2400 

ACTATGGAAA ATAAATATAC ACATGGTGTC CTCTTTTACC ATGAACACAG TGGATTAAAA 2460 

AATATTAATC AAGGTATTGG GGAAGTTACA ACAGCATTAA GTTCAATTTG TAAGCATCTT 2520 

55 TCTATTCAAT TGAGTGAAAA CGAAGGTGAC ATTATTAAAT ATTGCCAAGA AATCAAAACT 2580 



35 
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AAAAACTATC CAAAAGATGT AGATAG _ 2606 

(2) INFORMATION FOR SEQ ID NO: 2 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 367 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 



CTACGATGGC 


ATCTAAATCT 


GCGGTGTCGT 


CAATAATCAG 


TGCTATTGNC 


TGGTGGNCCT 


60 


GCNATGTGAG 


CAATACGCTA 


CCTGTCCAAA 


TAAANATTTT 


GNGGNAGATG 


CAACAAATTG 


120 


GGTACCTGGA 


CCNACAATCT 


TATCAACTTT 


AGGTATCGTN 


TCTGCNCCAT 


AAGTCAATGN 


180 


AGCAATACTN 


TGAGCACCAC 


CNNCTNGANA 


CACTTGAGTA 


NCTTGCGTAA 


GATAACATGC 


240 


AGCTAATACC 


TCTTGGGATA 


CTCCGTTAGG 


GTGAGG TGGT 


GTCACANCAN 


CAATATTTTC 


300 


TACACCCGCT 


ACTTGTGCTA 


AAGTACNCTG 


TCATTAGANC 


CGTTQATGGA 


TAACTTGCTT 


360 


TGCCACG 












367 



(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1485 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 





GGCACGAGCG 


ATATTTGTCA 


GGAAGTTAAA 


GTTCATGGAG 


AT AAGGC TTT 


GAAAATGTAT 


60 


45 


AATCTAACAT 


TCGATCATAC 


GAAAACAGAT 


CATTTAGAAA 


TTAGTCATGA 


ACAAATTAAA 


120 




GCAGCATTTG 


ACACATTAGA 


TGAAAAAACA 


AAACAAGCAT 


TACAACAAAG 


TTATGAAAGA 


180 




ATTAAAGCAT 


ATCAAGAAAG 


TATTAAGCAG 


ACGAATCAAC 


AGTTAGAAGA 


ATCAGTGGAG 


240 




TGTTATGAAA 


TATACCATCC 


AGTAGAAAGT 


GTGGGTATTT 


ATGTGC CTGG 


TGGCAAAGCA 


300 


50 


AGTTATCCAT 


CAACGGTTCT 


AATGACAGCG 


ACTTTAGCAC 


AAGTAGCGGG 


TGTAGAAAAT 


360 




ATTGTTGTTG 


TGACACCACC 


TCAACCTAAC 


GGAGTATCCC 


AAGAGGGTAT 


TAGCTGCATG 


420 




TTATATTACG 


CAAGTTAATC 


AAGTGTTTCA 


AGTTGGTGGT 


GCTCAAAGTA 


TTGCTGCATT 


480 




GACTTATGGA 


ACAGAAACGA 


TACCTAAAGT 


TGATAAGATT 


GTAGGTCCAG 


GTAACCAATT 


540 


55 


TGGTNGCATA 


TGCCCAAAAA 


ATATTTATTT 


GGACAGGTAG 


GTATTGTTCC 


ANATTGCAGG 


600 
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ACCANCAGAA 


ATAGCACTTG 


ATTATTGACG 


ACACCGCCAG 


ATGTAGATGC 


CANCGNCTflJT 


660 


NATGTTTTTG 


GGCAAGCNAG 


AACATNATGA 


ATTAGNACGT 


CCATNTNTCA 


TTGGTGAAGA 


720 


TGCGCAAGTC 


C TN AAAG ATT 


TNGAATCACN 


TATTGCTAAN 


GCATTGCC TA 


ATGTGGACAG 


780 


ANACGACATT 


GTTTCTANAA 


GTATCGCNAA 


TCAACACNAC 


CTNATCCANN 


ATNGTAATTN 


840 








rl 1 /\L./\rl J. \.uL 




UCGTCGATTC 


q n a 


AAACAGTAAA 


TCCTCAACCA 


GGAATTCTTG 


TTGAACGAAT 


AATTTCTATC 


TCGTTATCAT 


960 


CGTAAGCGTC 


ATTATCAACT 


TGTGTTTCTT 


GCATTTCTTG 


TAATTCGGCA 


AC AAAC AC TT 


1020 


CTTGATCTCC 


TCGATCACGG 


CTCTTACGAT 


TAATACGTGT 


TTTATATTTT 


CGAACTTGTC 


1080 


TTTCAAGTTT 


ATTATTAATT 


AAATCAATAC 


CTGCGTATAA 


ATC ATCGTTT 


CGCTCTTCAG 


1140 


CTCTTAACGT 


AACATTTTTC 


AATGGAATTG 


TTACTTCAAT 


TTTAGTAGCT 


GAATNTGAAT 


1200 


AAGTTTTAAC 


TTTAACATGC 


GCCACTGCAT 


NTGGTACGTC 


ATTAAAATAA 


CGTTCCAACT 


1260 


TACCAATTTN 


CTCCTCAAAT 


ATAGNTGCGA 


ATAGCATCTT 


GTGATAGGGG 


AGGGNTATCT 


1320 


CCAATGAATT 


TNCAAATCTA 


TATCATANGG 


GAATCTCTCC 


CTTAAAC CTC 


TTTATTGGTA 


1380 


ACTCTTTATT 


ATATTNNNAC 


ATTNNNACGC 


TATCGNGCAA 


ACGCAAACAC 


TTTGGATTCT 


1440 


CTGATATTNT 


NNAGCATATT 


AATTTACAAC 


CCTGCACGAT 


GATTG 




1485 



(2) INFORMATION FOR SEQ ID NO: 219: 
(i) SEQUENCE CHARACTERISTICS: 

25 

(A) LENGTH: 1514 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

35 



40 



45 



50 



55 



ATGATGTAAA 


CCCAGTTCCN 


ANCTTCATCA 


TTCAGTTGAT 


GCAATAATGT 


CAC NNAAACT 


60 


CTACCACCAA 


GATGCACCAA 


TAGGTTGTCC 


CTAAATGCAA 


TAAGCGCCAC 


CCTTTAACAA 


120 


TTCACCCTTT 


TCAGGGAGGT 


NATTTTAAAT 


TCACCGATCC 


AACAGCTAAT 


GATTGTCCTN 


180 


CAAATGCTTC 


AATTTAATTC 


CAAATACATC 


AATATCGCTT 


AATTCTTTTT 


TACTACGTTT 


240 


CCAAAGCCTT 


TTCTACAGCG 


CCAACTGGTG 


CAATACCCAT 


AATAGAAGGA 


TCTACGCCAT 


300 


GACTTCCCAA 


AGCCATCCAA 


GCACTGCCAA 


TGGTTCGATA 


TTTAATTCTT 


TAGCTTTGTC 


360 


TTCTGACATG 


ACTAACATCA 


TCGCAGCACC 


ATCATTGATT 


CCTGATGCAT 


TACCTNCTGT 


420 


AACTGTCCCG 


TCTTTTTTGA 


AAGCTGGTCT 


TAAGCGACTT 


AATTTTTCGA 


C TGAT AC ATT 


480 


TTCACGGACA 


CCTTCATCCT 


TAGTGACTAA 


GATTGGTTCA 


CCTTTACGTT 


GAGGAATCGA 


540 


TACTGGAACT 


ATTTCACTAT 


CAAATTCACC 


ATTTTGCTGT 


GCACGTACTG 


CTTTATTGTT 


600 


GTGAGATTAC 


CAGCAACATG 


T ATC TTTGTT 


CTTCTCTTGA 


AATACCATAT 


TGCTCCTACT 


660 


AAATTTTCAG 


CAGTAATACC 


CATATGATAT 


TGAGTAAATA 


CATCTGTTAA 


ACCATCATAT 


720 


ACCATGCTAT 


CAACCATTGA 


GTGATGTCCC 


ATTTTAAAAC 


CGAAGCGACT 


G TTGTTGNC A 


780 


AGCATTGGTG 


NCTGAGACAT 


ATTCTCCATA 


CCGCCAGCGA 


GCACGATGCT 


CATGCCCACC 


840 


AGTCAGCNAT 


AGATTGATAT 


GCTAATTGAN 


TCGACTTTAA 


CCCAGAGCCC 


ACATACTTTA 


900 


TTCACTGTAA 


ATGCAGGTGC 


TGTTTCTGGC 


AAGCCACCTT 


TCATAGCAGC 


AATTCGTGCT 


960 
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GGATTTTGTC 


CTTGTCCTGC 


TTGTAGTACG 


TTACCGATGA 


TAACTTCATC 


AATCTCACTT 


1020 


GGATTCAAAC 


CCGTCTCTTT 


AATAATATGT 


TCTATTAAAG 


TCGCACCTAA 


ATCATAGGCT 


1080 


GGCACGTCTT 


TAAACGCACC 


TCCAAAAACG 


CCAATAGGTG 


TCCTGTATGC 


TGCTGCTAAT 


1140 


ACGAC TCTCG 


TCATGTCATA 


TTCATCTCCT 


TATGATATAT 


ACAATTTTAT 


TACTAATACG 


1200 


TCTAATGAAC 


CAATACGCCA 


CTAACATCCA 


ATTTATGAAA 


CCAATTGCTA 


C GTATGTT AC 


1260 


ATTTTCATTC 


TTGTTATGCA 


TTTAACAAAT 


CATATTATAC 


AATGATGTAA 


TTTG TTTGGG 


1320 


CAATCGGAAA 


CGTAAACGGT 


TTCACTTTAA 


CATTTCACAC 


TCTTAAATAT 


GCTCTATCAT 


1380 


CCTCATCTAT 


AGTACAAAAT 


TATGCTTTGA 


TTTACCACAA 


ATTATCATTA 


ATCATTTCGC 


1440 


TCGTGCCTCT 


AATTAAATTC 


TTTCTATCAT 


TGGATCGNTA 


TCACAGGTGG 


NTTCACCACA 


1500 


ATTCTCATCA 


ATTT 










1514 



IS 

(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1000 base pairs 

20 

(B) TYPE: nucleic acxd 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 



35 



40 



45 



SO 



GGCACGAGCT 


CAAGTAAAGT 


GCTACCATTT 


TTC TTAC AG A 


CATTTGCCGA 


NATCTTAAAC 


60 


TAAGACATAN 


CCCCTAAAAT 


ACTCNCTANA 


GNACCGNACA 


ACCNCAACAC 


AATTCCTGTC 


120 


ATGATC TT AG 


GCTTTGTGCC 


ACCTNCCCCA 


GTTGNCAAAT 


GCTTTGTCCA 


TCCTTCTTGT 


180 


AATGACTGAA 


ATCCTTCTTG 


ATACATNCGA 


AATGCAACAA 


ATGGAAATCC 


TTCATAAACT 


240 


GTTACGGGCA 


ATGATTGTGA 


AGTAGATGCA 


CTTCCTAAAG 


CAAATCCTTC 


AATAAAAATG 


300 


ACGGGTCGCA 


CTTTTATGAC 


CTCCAGTTGC 


ATAATAATCT 


TCTTTATTTG 


TTAATGTCAC 


360 


CGGTCCAAAT 


GCATGCTGGT 


TAGTCCGCAC 


CGTCGTCTAA 


GGTANAAAAT 


ACATTCATAC 


420 


CACCGACTTG 


TCATTAAATT 


AAATATCGCT 


GAAAACCCTT 


CGTAGAAACG 


CTTTGTTATA 


480 


TGATAAGGCT 


GTACGCTTAA 


CAATCCTTTT 


TCACCTTGTA 


ATTGAT AC TG 


ATTAATCAAC 


540 


GTTTCAACAG 


CATCTTTCCT 


TAAGAAAGTT 


ACATCAGCAT 


CTACAAAGGC 


AATACGATTC 


600 


GTACATGCAT 


GCGTCACACC 


TTGATAACAA 


GCATGTGATT 


TCCCCATACC 


ATTTGCCGTC 


660 


AGTATCATCA 


ACAACATCTA 


CTACAGTCGC 


ACCATATGAA 


CGTGC TAC AC 


GTGCTGTTTC 


720 


ATC TGTCG AT 


CCGTCATTCA 


TAACAATGAC 


ATCTACTGGA 


ACTTGC TGTT 


GTATTATCGA 


780 


ATGTAGTAAA TGACCTATTC 


TTTTTTCTTC 


ATTTCTAGCC 


GGAATAATAA 


TTGTTAATGC 


840 


TTTATGATTG 


AAGTTCAGCG 


TTTTCGTC TT 


TAGCTGATGT 


CTACGATTAA 


ATATCAATGC 


900 


ACCACACGCC 


ATAGACATGG 


TCACTATTAC 


TGTTAATATT 


CGTGATAACC 


ATTTCATAGA 


960 


TATCACTTCC 


TATTCTTCGC 


TTC TCGCGCG 


AGCCCGNGCC 






1000 



(2) INFORMATION FOR SEQ ID NO:221: 
(i) SEQUENCE CHARACTERISTICS: 



185 
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(A) LENGTH: 1047 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 



15 



20 



25 



30 



GGCACGAGCT 


ATATCCAATT 


GGGGTAAAGC 


GGGTAAAGAC 


CAAGAGAAGA 


ATTTAAGAAG 


60 


ACATTAGCAG 


AAATTGAAAG 


GAC AC CGGC A 


CGAGC TAGCG 


CAACAAGTTT 


AGCGATTGAA 


120 


GCATTTGGTG 


CAAAAGCAGG 


TCCTGATTTA 


GCAGATGCTA 


TTAAAGGCGG 


TCGC TTTAGT 


180 


TACCAAGAGT 


TCTTAAAAAC 


TATCGAAGAT 


TCGCAAGGAA 


CGGTCAATCA 


GACATTTAAA 


240 


GATTCTGAAA 


GTGGCTCCGA 


AAGATTTAAA 


GTAGCAATGA 


ATAAACTTAA 


ATTAGTAGGT 


300 


GCTGATGTAT 


GGGCTTCTAT 


TGAAAGTGCG 


TTTGCTCCAG 


TCATGGAAGA 


ATTAATCAAA 


360 


AAGCTATCTG 


TAGCAGTTGA 


TTGGTTTTCA 


AGTTTAAGTG 


ATGGATTTAA 


AAGGTCGATT 


420 


GTTATATTCG 


GTGGTATTGC 


TGCTGCAATT 


GGCCCTGTAG 


TTTTTGGGAT 


TAGGTGCATT 


480 


CATAAGCACA 


ATTGGCAACG 


CAGTAACTGT 


TTTAGCTCCA 


TTATTAGCTA 


GTATTGTAAA 


540 


GGCTGGCGGA 


TTGATTAGTT 


TTTTATCAAC 


TAAAGTGCCT 


ATTTTAGGAA 


CAGTCTTCAC 


600 


AGCATTAACT 


GGTCCAATTG 


GTATCGTGTT 


AGGTGTACTG 


GCTGGTTTAG 


CAGTCGCATT 


660 


TACAATAGCT 


TATAAGAAAT 


CTGAAACATT 


CAGAAATTGT 


GTTAATGGTG 


CAATTAACAG 


720 


TGTTAAACAA 


ACGTTTAGTA 


ATTTCATTCA 


ATTTATCCAA 


CCTTACATTG 


ATTC CGTTAA 


780 


AAACGTCTTT 


AAACAAGCGG 


TTTCAGCAAT 


CGGTGATTTC 


GCTAAAGATA 


TTTGGAGTCA 


840 


AATTAATGGA 


TTCTTTAATG 


AAAACGGAAT 


CTCTATTGCA 


CAAGCGCTTC 


AAAATATATG 


900 


CAATTTTATC 


AAAGCTATAT 


TTGAATTTAT 


CTTAAAATTT 


TGTAATTAAA 


CCAATCATGT 


960 


GTGCGATTTG 


GCAAGTGATG 


CAATTTATTT 


CGCCGGCGGC 


TAAAGCCTTG 


ATCGTCAAGT 


1020 


ACTTGGGAGA 


ATATAAAAGA 


GTAATAC 








1047 



35 

(2) INFORMATION FOR SEQ ID NO:222: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:222: 

GGCACGAGCG GTACGAGTTC ACTGTTAATC TGCACTTCAA TGTGATTATG ATCTGAGTTA 
TCCTGCATTT CGTAACCGAT AGCGATGTTT GTTACTTTTT CAAAATTAAT ATCACTATTG 
TTCGCTAAGC TTGAACGTAC AGACTCTAAT TTCGGCAACG TTTTATTATC CTCACTATTT 
AACACAACGT CGGTGCGCAA TAACGAACGA CGATAATCAA AGACGCCTTT TTCACCCCAA 



186 
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5 



10 



15 



20 



25 



GTGACTTGAA 


TTTGATT AG A 


GPCTTCTTTA 


TT AAACG TTG 


GATAACCGTT 


AAGGAAACGT 


300 


TGATATGTTA 


ACTCGCCTGA 


C TG ATGGC AT 


TCGTACTAAA 


C AATC TGAAG 


TCTTCGTTTA 


360 


AGAAACC AC C 


ATGAC C ATTA 


ATAAAATCAA 


AGGTTCCTGG 


AATCGTTTCT 


TCCATTGTGC 


420 


TGGAACTTGC 


TTCATCTTCG 


GACAGGTTTT 


TATAATGATA 


TTTCNCATTT 


TTATCGTTAT 


480 


AGTGTGNGAC 


ACCTGTATTA 


TTGNTGTAGG 


TTGTAACACC 


ACTCTNTGAN 


CTACGAACGA 


540 


TGGNTGANTC 


GACAAATAGT 


ATAGCATNCA 


TTTTCTCAAC 


ACTAATCGTG 


TTAAATAC C A 


600 


TGCGATATGT 


TTTTANCTTN 


NCAGGTTTAC 


CTTGGGTGCA 


AAAACATGCG 


TCGNACGATC 


660 


AATTGTATCT 


NTGTGTGTGA 


TGATATCTTG 


TGTATGGGTT 


GCATATCTTT 


TCACACTGCT 


720 


GCTTAATGCA 


TCTTAAAAAA 


TGNATCATTT 


CGCGGTGG T A 


GNTGNTTAAT 


TTTACGGTAA 


780 


TCGGGGCGGT 


CTTTACTTAT 


AGCATAAAGC 


ACGATATTAT 


CATCAGCATC 


ATGATCTATG 


840 


ACTAAACGAT 


TGAAATTGAA 


ATGATTTGGT 


ACTTTCGCAT 


TCAATGTTCA 


NGAACTTGGC 


900 


P A A CI A T A TVTT 


TV A T A A CfZTW 


AAA * T, r* AAA CCZ 


TA A AATTT^ A A 




TV* Ar-TYIPTT A 


j o \j 


AATCCGGAAT 


CATCAAGTTA 


TGATCACGAC 


GAACATGTTC 


CACGGACNTC 


ACTTCTTTAT 


1020 


TTTTCAAAGG 


TTTCGTCAGT 


TTATTCACAT 


TAG ATAC CGT 


CGCAATCGTA 


CCTTCTGGAT 


1080 


GATCATTTTT 


CGAATGAATA 


ATCTGAAATG 


GCGTAATAGT 


TGTATCCATT 


TTGGCTGTCA 


1140 


TAGGTGTCGT 


TAAAGGTTTC 


GTTTCACTCT 


TCTTACTATC 


TGTATTGTCG 


ACATTTGCAA 


1200 


TATCAGGAGA 


AAAGTTCCAT 


ACCATATATG 


TCAATACGAC 


ACTCATCAAG 


ACGAGTAGCG 


1260 


CTAAAATGAC 


AGATTTAATA 


TGTTCCTTAT 


TATTCATCCC 


AATCACCGTC 


TTCAATGACT 


1320 


TCACATGGAA 


GTGTGATAAA 


GATAGATGTA 


CCTTGAC CTT 


CTACACTGTT 


TGCCCAAATA 


1380 


CGACCATTGT 


GCGCTTCCAC 


AATCTCTTTC 


GAAATGGCTA 


GTCCTAATCC 


AGTACCACCC 


1440 


ATTTTACGCT 


CGTGCCGTNC 


CTTATCTACA 


CGATAGAATC 


GGGNCGGAAT 


ATCTTTATCG 


1500 


ACTTTATTGG 


ATAGGAATTC 


CCGATG 








1526 



(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



45 



50 



55 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO:223: 






GGCACGAGCG 


AGCAATATCA 


AATAATAAAT 


CATCATCAAT 


AGTTCTTCCT 


TGGAAACGTT 


60 


CAGGGATGAT 


GTTGAATAAT 


AATGACGTAT 


CTAATATATG 


GTCATATAAA 


GAGAAATCAC 


120 


CAACTGGGAT 


GC TATCT AAG 


TGATAGTACT 


TTTGTAATAA 


TAAATTTTCT 


TTATGTAGAT 


180 


CAGGNAATGN 


ATGCATCTAN 


CNACTCNTCN 


TTAGANNTCN 


TCNTTGCCCA 


ANAACTTTCG 


240 


ATGGCTTTTC 


TCCATTCCCT 


TTCTCNACCT 


AATCTTGGGA 


ANCCTAAGTT 


TGATGTTTTA 


300 


ATTGTTGACA 


TAATATTGCC 


TCCTTGTGAG 


CAGGTAATAG 


ATTTNGAGTA 


TGCTGCAAGT 


360 


TCTAATGAAT 


CTTCGACATT 


TNGANACGGT 


GTGATAATGT 


ATAAACCATN 


AAAATATTCA 


420 


TGAACAGTAT 


CGATTAAATC 


CTTTGAAAGC 


TTAAGACTTA 


GTTCTCGCGT 


TTTGGCTTTA 


480 


TCATCTCTAA 


C TGC TTC AAA 


TTGGTGGAAA 


ATTTCANCTG 


ACATCTTGGA 


TTCCNGGCAC 


540 








187 
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5 



10 



15 



TNCATTATGC 


AAAAAGAGTG 


CGTCNTTGTA 


ACTTGCGATA 


GGCATAATGC 


CTATGAAAAA 


600 


TGGNTTGNTC 


AAGTGCTTAG 


TGGCATGGTA 


AATTTCAATG 


ATTTTCTCTT 


TGCTGTACAC 


660 


GGGTTGTGTT 


ATNAAATAAG 


ACATTCCGCT 


TNCTATCTTT 


NTNCTCTAAT 


CTTNNGACGG 


720 


CACCATCTAA 


TTTACGAACA 


TCAGGGTTAA 


AGGCGCCAGC 


GATGTTGAAG 


TGTGTACGTT 


780 


TCTTCAGCGC 


ATCACCCCCA 


GNGTTAATAC 


CCTGATTAAA 


TCTTAGAGCG 


AGTTCANCNA 


84 0 


ATCCTTTAGG 


AATTAACATC 


ATAGACATTG 


GTTGCACCTG 


GTAAGTGACC 


AACTTTGGAA 


900 


GGATCACCAG 


TTATGGCTAA 


TATTTCGTTA 


ACGCCAATGA 


GCGATAATCC 


AAGTAAATGG 


960 


GACTGCAAGC 


CGATTAAGTT 


TCGGTCTCGA 


CATGTAATAT 


GTACGAGTGG 


NTCAATATTG 


1020 


TAATATTGCT 


TAATTAAGCT 


AGCAGCAGCA 


ATATTGCTAA 


TTCTGACAGT 


TGCCAATGAA 


1080 


TTATCTGCGA 


GTGTTACCGC 


ATCTACATTA 


GCTTTATCAA 


GTTTAGCGAT 


ATTTTCAAAA 


1140 


AATCTATCCG 


TGTCTAAATG 


TTTCGGTGTA 


TCCAATTCGA 


TAATAACGGT 


TGGACGTTCT 


1200 


TGAACC TTAG 


ATGTTAATGA 


TTGTCTAACT 


TTATTTTGAG 


ATGGATTGAA 


AAGTGCTTTC 


1260 


GTTGGTATCG 


GAATCACTTT 


TNTGNCANTA 


ACAGGTTTAA 


GTGTCNGAAT 


AGATTCTTTA 


1320 


ATAAATTTGA 


TGTGCTCTGG 


CGTTGTACCA 


CAGCAACCAC 


CAATTAAACG 


AACACCTTCG 


1380 


CGAATTAGTG 


CCATTNTGGN 


GNAACTTGAC 


CGAAATATTG 


T 




1421 



20 

(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1886 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 4: 

35 





TTATTTAAAA 


GTATCCAATA 


GCCAACAATC 


TCTTTCATCT 


TTGAGTAGTA 


TCCCACGTAT 


60 




TTCGGGATTG 


CATCCGAGTT 


GATTTTTATN 


ATTTTCTTCA 


ATGCTATCAA 


GCGCAGCTCG 


120 




TTTACGTCTT 


AC TTT AGGTT 


TTATTTGCTC 


AATTGCCTTG 


ATTGTTTGAT 


TTCTAACATC 


180 


40 


AGTAACAGCA 


GCATCTTGAT 


TTGTATTTTC 


TATTTCTTGT 


TGCGCTTGTT 


TGAGTGTGTC 


240 




GCTAATTAAT 


TGATTCGCTT 


CATCTAATTC 


ATCAACTGTT 


GCATGTGGTG 


TATCTTTTAT 


300 




TGATTCT AC T 


TGATTTTCTG 


CAGTTGCTTT 


TATTGCTTGT 


TGTGCTTCAG 


GCTTAATTAC 


360 




AATATGAGGT 


TGCACGAGCA 


CCTTTTAGTG 


TAGCAATGCC 


ATTNGTTTCA 


ACACGTTTCA 


420 


45 


CATCATTANT 


CGTGACTGCT 


TGATTGAGGT 


TTTGTAATGC 


AAGTTTTTCA 


TTATTCGCTA 


480 




ATTGATTTAA 


AGCAACTTGT 


TTTTCTTCAT 


CAGTCNCATG 


TTCAGCTTGC 


TCTATTTCTT 


540 




GCTTTTTAGC 


CTCATATTGT 


TGCTTTACTG 


CATCTCNAGC 


AGCTGCTCTA 


AAAATATGTT 


600 




CAGGCGCTAC 


TAAAGCAATG 


CTATCAAGCG 


CTTGACTTGT 


TGTATCATCA 


ACTTGTTGAT 


660 


50 


TTGTTCTATT 


ATTCGTAATA 


TCTGTCATGG 


CTTGATTTAC 


AAATTCNTAT 


GNTNTTATCT 


720 




AGTGCTACGT 


TGTCNNTCNG 


CCTGCTGTTG 


CNTCTTTATC 


CCTGANTAAT 


CGTAGCACAG 


780 




TAAATCATTC 


GCNTGTTGGA 


TTGAGTTTTT 


CCACGTTGCC 


AGCTGGNGGT 


AACTTTTTGT 


840 




TTCAGGGTGA 


ATAATNTTAA 


TCGCTGAT A C 


ACCATTTGTT 


CCTGCTTGAT 


TCACCTGACN 


900 


55 


ATTCGTTTGA 


GCNTGGTCAA 


TANCTCCAAG 


TGCTTTTTCT 


TTTTCTTTAG 


CTAATGCTTG 


960 
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5 



70 



15 



20 



TGAAGCAACT 


TCTTTCTCAT 


NATCTGTTGA 


ATCAAGACTA 


TTATCAATTT 


GCTGTTGC£T 


1020 


TTCTTTAACA 


GCTTTTTCAA 


NATCTGCAAT 


TGCCTTTGGT 


TTAATTACTA 


CTTCAGCTNC 


1080 


AACATTATCT 


ATAGCATTTA 


CCGCTTGATT 


TGTAGTTGTG 


TCTACCTGAT 


CATTTGTTTG 


1140 


GTTTTGATTA 


ATTTGATTAA 


TTGCTTGATC 


TTTAAGTTGA 


GTGATTTGAT 


TAACAGCAGC 


1200 


CTGCTTTTCT 


TCGTCAGTTG 


CATTAGGTGT 


TTGTTTAACC 


GCTTCAATAC 


GCTTCGCCAC 


1260 


TTCAGCAGTG 


ATTTTATCTC 


GCGCTGCTTG 


TTTTTTTACT 


ACATCAACTT 


GAACAGCATC 


1320 


GATATTATTC 


TCTGCT AC TG 


TCGCAGCTTG 


GTCTACTTCT 


GCATTTGTGT 


TAGCTTGTTT 


1380 


AATACTTCNA 


ATAGCTNGTT 


GTCTGTCTTG 


ATTTAAAGTA 


TAGATCGCAG 


CATTTNTCTC 


1440 


AGCATTCGTT 


GCATCTGGTG 


TAGCATTGAT 


TTCAGCTAAT 


TTAGCATTAT 


AATGCTGATT 


1500 


GATTTGTGCT 


AATGCTGCAG 


GTTTATTAAC 


AATATTTGGC 


TGAATCGCAT 


TAAT TGCTTT 


1560 


TGTACCTAAT 


TGTTGCGCTT 


GATCTACTTC 


CGC ATTTGT A 


TCAGCTTGAT 


TTATATTATT 


1620 


AATTGCCGTT 


GCTAACTCTT 


GATCCACTTG 


ATTTAAAGCC 


ACTTGCTTTT 


CTTCAGTTGT 


1680 


TGCATTTGTG 


TTTTGATTAA 


TTTCTTGCTT 


TTTAGCAGTT 


GCTAAATCAT 


TTAATACACC 


1740 


TGTAGCAGGT 


TGTTTCTTCG 


TTACATGCGG 


TTGACTGCGC 


CGATTTGATG 


AC TGC ATCGN 


1800 


CTCTAATACT 


ATTGACCACC 


GAGTAGTAGA 


CGTCACACAA 


TATCAGTAAT 


GCTCTATTTT 


1860 


AGTGATTGAA 


AC GAATATCG 


GTCTTG 








1886 



(2) INFORMATION FOR SEQ ID NO: 22 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 





GGCACGAGAC 


AATATATCTA 


CCGTGAATAA 


CTGTGCAAAT 


AGTGAAG TTG 


TAGCCGCCAT 


60 




ACGCATTTCA 


TTTTCATCAG 


TTCTGCCATA 


AATCAATGCA 


TAGTCGGCAA 


TTTGAGCCAC 


120 


40 


GTGC CGATTA 


TTCGGCACGA 


GCTGTACTAG 


ATATAGTTAT 


GATGGGAATA 


CTGTAATGTG 


180 




TGGCCACCTG 


TGCAATTGAC 


TGCAATTCAC 


TATGACTACC 


TTGATTCGTC 


ACAAAAATCA 


240 




TGCAATCTCT 


ATCATCATGC 


GTCGCAAATG 


TTGACACAAG 


TAAATGCGTT 


TCATGTAATA 


300 




ACCTGACATT 


TAAGCCAATA 


CGAGATAACT 


TTTGAAAAAG 


ATCACCAATA 


GTCAAACTCG 


360 


45 


ATGCGCCAAA 


TCCAAATAAA 


AATATTGTCC 


TGGCATTTTT 


CAACACATCA 


CAAATTGCAT 


420 




CAATTTGCGC 


ATCCATAATA 


TTAGTAGCTA 


CAAATCGCAT 


CCGTATTCCG 


TTGCTCTAGC 


480 




AATCATTTTA 


TTTTTCAAAG 


TTTCTACAGA 


TTCATTTTCA 


ATCAATTCTA 


AATGTGGATT 


540 




GGTTGCAATA 


TCTTCGGGTA 


AGTATCGAGA 


TATCGCAATC 


TTTAGCTCTT 


G AAAACC TTG 


600 


50 


ATGTGTCATT 


TTCCGACTAA 


ATCTAACAAT 


TGATGCTGTA 


CTAACATTCG 


TAACATCTGC 


660 




CAAATCATTC 


AC CGTC AT AT 


CAATGATTTT 


ATGTGGATTC 


TTTAAAATGT 


AATCAGCGAT 


720 




TATCTTTTCT 


GTCTTCGTAA 


AATCACTCAA 


CTGTTTATCA 


ATGCGATATT 


AAAATATTTG 


780 




TCATCATTAA 


TCACCCAACA 


AATCTGTCTT 


GTCGCATCGC 


CTTTGTCGGT 


CCAAATAAAT 


840 


55 


ATGTACAAAC 


GAATCCACCA 


GCATATGCAG 


CAAGTAATCC 


TGCAATATAA 


CCTAAATAC A 


900 
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TATTATCTGA 


GATTAATGGT 


AAGAGTGACA 


CACCACTTGG 


GCCTATTGCT 


TTGGCACCAA 


960 




TATGTCCAAT 


TCCACCTATT 


ACAGCGCCAC 


CAATACCACC 


ACCAATACAA 


GCAGTTAAGA 


1020 


5 


AAGGTCGAAC 


TAATGGGCAA 


AGTCACACCA 


TAGATTAATG 


GTTCTCCGAT 


ACCTAGGAAA 


1080 




CCAACTGGCA 


ATGCACCTTT 


TAAAGTATTA 


CGTAATGTTG 


TGTTGCGTTT 


ACATCTTACC 


1140 




CAAAGTGCTA 


ATGCGGCACC 


TAC TTGTCC A 


GCACCCAGCC 


ATCGCTGCAA 


TTGGCAATAA 


1200 




GTAAGTAGCA 


CCAGATTGGT 


TAATCAATTT 


CTATATGAAT 


TGGCGTNAAA 


AATATGATGA 


1260 


10 


AGGCCCTAAC 


AATAACTTAA 


CGGTAGGAAG 


CTTGGTCCAA 


ATGATAAATC 


CACTTAAATT 


1320 




ACGNCACCCA 


ATACTTAATA 


ATCCCCGTTA 


CTACTGAAAC 


TTAANTTGTC 


TTTGAAACAA 


1380 




A 












1381 



(2) INFORMATION FOR SEQ ID NO: 226 ! 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 



30 



35 



40 



GGCACGAGCG 


TTTTAGTTGA 


TGTGTATCAC 


TAATATCTTT 


GAAAATTTTA 


ATCAGGTACT 


60 


ACGACAATAT 


GATGTCTGTT 


TTGTGTCTGA 


AAGTTTTACA 


GTTTTTAAAA 


TAAAAATGGT 


120 


ATAAAGTGTG 


ATTTGTATAA 


AAAAGAGTCT 


CGACGGATAA 


GAATTGATTA 


ATAACAGTTA 


180 


GCATTTTATT 


AATTACCTTA 


ACAATGATTC 


AAGTTT AG TT 


AAATGAGGTT 


TAATTTGAAA 


240 


GGGGATAGCG 


CCTCAATATA 


ATGTAGGTAG 


ATTGTTCATA 


TTACGTAATT 


GAAAAATCAA 


300 


ATTTAAATAN 


ATAGGGTGGG 


GCTNNAAATT 


ATGAAATTTA 


AAGCGATAGC 


AAAANCAAGT 


360 


TTAGCATTGG 


GAATGTTAGC 


AACAGGTGTA 


ATTACATCGA 


ATGTACAATC 


AGTACAAGCG 


420 


ANAACAGAAG 


TTANACAACA 


AAGTGAATCA 


GAG TTG AN AC 


ACTATTATAA 


TAAACCGGTT 


480 


TTAGAGCGTA 


AAAATGTTAC 


TGGATATANA 


TATACTGAAA 


AAGGTAAAGA 


TTATATAGAT 


540 


GTCATAGTAG 


ACAATCAATA 


TTCTCAAATT 


TCTTTAGTTC 


GGATCTGAAT 


AAAGACANAT 


600 


TTNAAGATGG 


AGACAACTCG 


NATATAGATG 


TGTTTATCCT 


TAGAGAAQGT 


GACAGTAGAC 


660 


AAGCAACNAA 


TTACTCAATT 


GGTGGCGTAA 


CACAAACAAA 


CAGTCAACCT 


TTTATTGACT 


720 


ATAG 












724 



45 

(2) INFORMATION FOR SEQ ID NO:227: 
(i) SEQUENCE CHARACTERISTICS: 

SO 

(A) LENGTH: 618 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 



190 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:227: 



10 



15 



CCTTCACCAA 


TACGTGTANT 


ATCNTTATTT 


GGGCGNTTGT 


CCACTTACCC 


ATGTAATCGT 


60 


GGCATCTGGA 


ATGGC AC T AC 


CATTAGATAA 


C TTAAAGT AA 


TCTGCTGCAC 


TAGAACCATT 


120 


ACCTGCTGGG 


AAGTCTTGGC 


CTTGTACAGT 


GTAATGCGAA 


TGTGCAACGA 


TTTCTGGAAT 


180 


GACATGCTTT 


ACAGGTACAG 


TCACTGTTGA 


TGTTGTTCCA 


TCTTGATGAG 


TAACAGTTAC 


240 


TGTGACATTG 


GCATTATTTC 


CAGTACGACT 


G AC ATT AC T A 


AC TGTTTT AC 


TAGTGATTTC 


300 


ATTTGCGCTT 


GCTGTTGCAT 


AG TTTCTATT 


TGGTGCATTA 


CTTGTAAACG 


TTAACGAATT 


360 


AATAATTGTC 


GCTTGATCAG 


C TTGTGAT AC 


TGTCGCATTA 


TTCGAAATAT 


TGGCAATTCT 


420 


AACAGGATTA 


GCAGNCCGTT 


GATGAAGTAC 


CAACTCGATA 


TTTATCACGC 


AAAGGTTTCA 


480 


CTGTTACATT 


GAATGAAGTT 


GTAGCTGTAT 


TACCACTNGT 


ATCAGTTGCT 


AATAAATTGA 


540 


TTGTCTTACT 


AGTTGCTGAT 


GTCACATTTG 


GTGCCGTTGC 


AGAAACATGT 


TGATGGTTAT 


600 


TATCAACAGT 


CTCGTGCC 










618 



(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1005 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 



40 



45 



50 



55 



TTGTGAGACA 


CACGCTTAGT 


GCACATGACA 


ATGTGAATGT 


TGTTGCACAA 


ATTATTAAAA 


60 


GAAGACTTAC 


CACTGTTAAG 


TTGTATCGAG 


CCGAATGATA 


CATTTGATTT 


TCCACNAAAC 


120 


TAGGGAGTGT 


AAGAAGTGAT 


GGAAAATAGT 


AGGCCCGAGC 


GTAATGAAGC 


GACGATGCAT 


180 


CTTGATGAAA 


TGACTGTGGA 


AGAGGCTTTA 


ATTACGATGA 


ATAAAGAAGA 


TCAGCAAGTC 


240 


CCGTTAGCAG 


TTCGAAAGGC 


AATACCACAA 


TTGACAAAAG 


TAATAAAAAA 


AACAATTGCA 


300 


CAGTATAAAA 


AGGGTGGACG 


ATTGATCTAT 


ATCGGTGCAG 


GTACAAGTGG 


AAGATTGGGT 


360 


GTCTTAGATG 


CAGC GGAGTG 


TGTACCTACA 


TTCAATACTG 


ACCCTCATGA 


AATTATAGGT 


420 


ATTATTGCTG 


GTGGACAACA 


TGCTATGACG 


ATGGCTGTAG 


AAGGTGC GG A 


AGATCACAAA 


480 


AAATTAGCGG 


AAGAAGATTT 


GAAAAATATA 


GATTTAACAT 


CAAAAGATGT 


CGTTATAGGA 


540 


ATTGCCGCGA 


GTGGCAAAAC 


GCCATATGTT 


ATAGGCGGTT 


TAACATTTGC 


TAACACAATC 


600 


GGTGCTACAA 


CAGTATCTAT 


TTCATGCAAT 


GAACATGCAG 


TTATAAGTGA 


AATTGCGCAG 


660 


TATCCAGTAG 


AAGTTAAAGT 


TGGTCCAGAA 


GTATTAACTG 


GTTCANCACG 


TTTAAAGTCT 


720 


GGTACAGCAC 


AAAAGTTAAT 


TTTAAATATG 


ATTTCAACCA 


TCACAATGGT 


TGGTGTCGGA 


780 


AAAGTTTACG 


ATAACCTCAT 


GATTGATGTT 


AAAGCAACCA 


ATCAAAAACT 


GATCGACCGT 


840 


TCAGTGCGTA 


TTATTCAAGA 


AATATGTGCT 


ATCACATATG 


ATGAAGCAAT 


GGCGTTATAT 


900 


CAGGTATCTG 


AGCATGATGT 


TGAAAG TTGC 


GACAGTTATG 


GGTATGTGTG 


GCATTTCTAA 


960 
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GGGAAGAAGC AACAAGACGG TTATTAAAAC AATGGTGACC TTGTT .... 1005 

(2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1040 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 



20 



25 



30 



35 



40 



GGCACGAGCG 


ATGACAGGAA 


TATGATATTG 


TTT TGCAGCT 


AACGCAACGC 


CTACGGGTGT 


60 


TTTACCAAAG 


ATGGTCTGAT 


AATCCATGCG 


TCCTTCTCCA 


GTAATAACGA 


GGTCTGCATC 


120 


TTTAATTCTT 


TGATGAAAAT 


CTGTAATGTC 


AAAGACGACA 


TCAATACCTT 


TTGTT AAAGT 


180 


TGTCTCACAA 


AACGCTAATA 


ATGCTGCGCC 


CATACCGCCA 


GCTGCACCAG 


AACCTGGTAT 


240 


TTGATTATAC 


GGACTTTCCT 


GTGCACATTT 


TTATCTTATC 


ATGACAATGN 


GACATTGCGA 


300 


AATCCAAATT 


AGGTATGCAT 


CTTTGCATCA 


GCGCATTTTT 


GAGGACCATA 


AATATAGGTA 


360 


GCACCATTTT 


CACCCAATAA 


AGGATTTGAA 


ACATCACAGG 


CCACTTTAAA 


GGTCACCTCT 


420 


TTTANTCGCG 


AATCGGCCAG 


ATTGGTTATA 


TCGATTTGTG 


CAATGTGAGC 


AAGATTAGCA 


460 


CCATTCATTT 


GTAATAAGTC 


CCCGTTTACA 


TCAGTAAACT 


TTACGCCTAG 


TGCACTTAGC 


540 


ATACCTGTAC 


CACCATCATT 


TGTTGCACTG 


CCACCAATCC 


CTAAAATAAT 


GGTCTTAGCA 


600 


CCATGATTTA 


ATGCATCTTT 


AATTAGTTCA 


CCGGTACCAT 


ATGATGATGT 


GTATAAAGGA 


660 


TTACGTTCCT 


CTTTTTCTAA 


TAAATCCAAA 


CCTGACGCTG 


CCGCCATTTC 


AATAATTGCA 


720 


ATTTGTTGTT 


CGTCTGCGCG 


TGCATAACAT 


GCTTCAATAG 


GTCGCATTAA 


AGGGTCATTA 


780 


ACGATGACTG 


TATACTTAGT 


GGCANCTGNT 


GCATGAATTA 


ANGCATCTGN 


GGTAC CTTCA 


840 


CCACCATCAG 


GCATCGGAAT 


GATATCATAA 


TGAAGGGTAT 


TCCCATAAAC 


ATTAGTAAAT 


900 


GCCTGTTTTA 


TAATATTGNC 


AACTTGGTGT 


GNGGNCATAC 


TTTCCTTAAA 


TGAGTCAGGA 


960 


GCGATGACAA 


TTTGTTGGGA 


ATGGATGGNA 


GNCACCNCTC 


AATATGNNGG 


CGNTCTTTTG 


1020 


GTTTTANATT 


NTAGTGACTN 










1040 



(2) INFORMATION FOR SEQ ID NO: 23 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 
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5 



10 



15 



20 



25 



TGTATAGGCC 


TATTTGTATA 


NAAAGTAATA 


TGATG TTTAG 


TGGCTATGTA 


AGTGAAGCAA 


60 


TGTATAAATA 


ATGATGAGTG 


GTTTGTTACT 


AATGATAATG 


GCTATGTAAA 


AGAGC AG TAT 


120 


TTATATTTNG 


CGGGACGTCA 


ACAGGATATG 


TTAATTATTG 


GTGGGCGAAA 


TATATATCCA 


180 


GCACATGTTG 


TNCGCCTTTT 


AACGCAATCT 


TCGAGCATTG 


ATGAAGCAAT 


TATCATCGGT 


240 


ATTCCACATG 


AGCGTTTTGG 


TCANATAGGC 


GTATTGCTTT 


ATTCTGGTGA 


TGTGACACTT 


300 


ACACATAAAA 


ATGNGAAACA 


ATTTTTNAAA 


AAGAAAGTGA 


AACGCCATGA 


AATTCCATTC 


360 


GATGATTCAT 


CATGTAGAAA 


AGATGTATTA 


CNCTGCAAGT 


GGTAAAATTG 


CTAGAGAAAA 


420 


AATGATGTCG 


ATGTATTTGA 


GAGGTGAATT 


ATAATATGAA 


TCAAGCAGTC 


AT AGTTGC AG 


480 


CTAAACCGAA 


CTGCATTTGG 


GNAATATGGT 


GGC AC TTTAA 


AACATTTAGA 


GCCAGAACAA 


540 


TTGCTTAAAC 


CTTTATTCCA 


ACATTTTAAA 


GAGAAGTATC 


CAGAGGTAAT 


ATCTAAAATA 


600 


GATGATGTAG 


TTTTAGGTAA 


TGTTGTTGGG 


ANTGGTGGCA 


ATATTGCAAG 


AAAAGC ATTG 


660 


CTTGAAGCGG 


GGCTTAAAGA 


TTCAATACCT 


GGCGTCACAA 


TCGATCGGCA 


ATGTGGGTCT 


720 


GGAC TTNGAA 


AGTGTTCAAT 


ATGCATGTCG 


CATGATCCAA 


GCCGGAGCTG 


GCAAGGTATA 


780 


TNTTGCAGGT 


GGTGTTTGAN 


AGTACAAGTC 


GAGCACCTTG 


GAAAATCAAA 


CGACCGCANT 


B40 


CTGTGTACGA 


AACAGCATTA 


CCNGAGTTTT 


TATGAGCGTG 


CATCATTTGC 


ACCTGAAATG 


900 


AGCGACCCAT 


CACATGATTC 


AAGGTGCTGA 


AAATGTGGCC 


AAGATGTATG 


ATGTTTCAAG 


960 


AGAATTACAA 


GATGAATTTG 


CTTATCGAAG 


TCATCAACTG 


ACAGCGGAAA 


ATGTAAAGAA 


1020 


TGGAAATATT 


TCTCAGGAAA 


TATTACCTAT 


AACCGTTAAA 


GGAGAAATAT 


TCAATACTGA 


1080 


TGAAAGTCTA 


AAATCACATA 


TTCCGAAAGA 


TAACTTTGGC 


CGATTTAAGC 


CCGTAATCAA 


1140 


AGGTGGGACC 


GTTACCGCTG 


CGAATAGTTG 


TATGAAAAAT 


GATGGTGCAG 


TTTTATTGCT 


1200 


TATTATGGAA 


AAAGATATGG 


CATACGAATT 


AGGTTTCGAG 


CATGGTTTAT 


TATTTAAAGA 


1260 


ATGGTGTTAC 


GGTAGGTGTT 


GATTCTAATT 


T 






1291 



30 

(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

35 (A) LENGTH: 557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

45 





CATTCATCTA 


ACCTNTTCAT 


CAGTTGAGTG 


ATTTTAATCT 


ATCCTTCAAA 


AGTTNGNGGA 


60 




TCAGGTCCAA 


TTCTCTTATC 


TAGATTTAAA 


CCAGGCACGA 


GCGTCATTTG 


TTCATCTGAT 


120 




AATTCGAAAT 


CAAATATTTG 


GAAGTTTTCA 


GAGATTCTGT 


TTGGTGTTAC 


CGATTTAGGG 


180 


50 


ATTATAACCA 


CACCATGCTG 


CACATTCCAT 


CTTAAAACAA 


CTTGGGCAGG 


TGACTTTCCT 


240 




AATTC TTG AG 


CAATGTCTTT 


AATTGTCTCA 


TCATTTAAAA 


TTTGTGCATT 


CATCAATGGT 


300 




GACCAAGATT 


CCATCACGAT 


ATGTTGTGCT 


GCCAAATATA 


ATTTCAATTT 


ATGTTGCGTT 


360 




AAATATGGAT 


GATATTCAAC 


TTGATTAATT 


ACAGG TTTAA 


TTG AC AC TTG 


TGCCAACAAA 


420 


55 


GCTTCCAAAT 


GTTCAGGTTC 


AAAAATTGCT 


GACACCTATA 


TTTTTAGCTT 


TATTATTTTT 


480 
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20 



25 



SO 



ATATAAATCT TCCATACCTT TC C ATGTT AT CAACCATTAC GGCTTCGTTC GTGCCTGGCC 54 0 

AATGTTACTA RRDCTNS 557 

(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 904 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 



GGCACAATTC 


GGCACGAGCG 


TTGGTATGTA 


GAATATGATG 


GTGAATGGTA 


TGTGTATAAC 


60 


GATGGGAGGC 


TTGAATAAAT 


GAACTGGAAA 


CTGACAAAGA 


CACTTTTCAT 


TTTCGTGTTT 


120 


ATTCTTGTCA 


ACATCGGGTT 


AGTATCGATT 


TATGTTAATA 


AAGTCAATCG 


CTCACACATT 


180 


AATGAAGTTG 


AGAGTAACAA 


TGAAGTTAAT 


TTTCAGCAAG 


AAGAAATTAA 


NGNACCGGCT 


240 


AGTATATTGA 


ATAAATCAGG 


TAAAGGTATA 


AAATGAGAGC 


AAATTACAGG 


GCGATCAAAA 


300 


GACTTTAGTT 


CTAAAGCTAA 


GGGCGATTCG 


GATTTGACCA 


CATCAGATGG 


TGGAAAATTA 


360 


TTGAATGCGA 


ACATTAGTCA 


ATCGGTAAAG 


GTCAGTGACA 


ATAACTTAAA 


AGATTTGAAA 


420 


GATTATGTTA 


ACAAACGTGT 


GTTCAAAGGT 


TCAGAATATC 


AATTAAGTGA 


AATTAATTCT 


480 


GGTTCTGTAA 


AATACGAACA 


AACGTATGAT 


AATTTCC CGA 


TTTTGAACAA 


TAGTAAAGCG 


540 


ATGTTGAACT 


TTAATATAGA 


AGATAACAAA 


GCGGCTAGTT 


ATAAACAATC 


AATGATGGAT 


600 


GACATTAAGC 


CCACAGATGG 


TGCAGATAAG 


AAGCATCAAG 


TTATTGGAGT 


GAGAAAAGCA 


660 


ATCGAGGCAT 


TATAGTATAA 


TCGTTACTTG 


AAAAAAGGTG 


ATGAAGTCAT 


TAATGCTAGA 


720 


CTCGGTTACT 


ACTCAGTCGT 


GAACGAAACG 


AATGTTCAAT 


TGTTACAACC 


AAACTGGGAA 


780 


ATTAAAGTGA 


AGCATGACGG 


TAAGGACAAA 


ACGAATACTT 


ACTATGTCGA 


AGCGACAAAT 


840 


AATAACCCTA 


AAATTATTAA 


TCATTAATAT 


GAATCGTAAT 


AAGCTAGTAT 


TGCAAGCTCG 


900 


TGCC 












904 



(2) INFORMATION FOR SEQ ID NO:233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 699 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 3: 
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5 



10 



15 



TTGTAC C AC C 


ACAGTTGTCA 


CGGAGACCAT 


AAAGATGCCG 


AGGAGTAAAA 


GTAAAAAGAA 


60 


CCAAATAGAA 


TTG ATACP A.<"3 


CRGCTGATAT 


GTTAAACTTG 


TTAGGGTTTG 


AATTACAACC 


120 


AACTAATGAT 


GGATTGATTA 


TTCATCCGTC 


AAGAATTTAA 


AACAAATGCA 


ACAGTTGATA 


180 


GTTTAACTGA 


TCATCGAATA 


GGAATGATGC 


TTGCAGTTGC 


TTCTCTACTT 


TCAAGCGAGC 


240 


CTGTCAAAAT 


CAAACAATTT 


GATGCTGTAA 


ATGTATCATT 


TCCAGGATTT 


TTACCAAAAC 


300 


TAAAGCTTTT 


AGAAAATGAG 


GGATAATATA 


AAATGGAAGA 


TATCTATAAA 


TTAATAGACG 


360 


ATATCAATCT 


ACAAAAACTA 


GAAAATTTAG 


ACTCTCGTGT 


TAATGAAGCA 


ATAACTACTG 


420 


ACAACGATGA 


CGCATTATTT 


ATTCTAGGAG 


AGACACTTTA 


CAATTTTGGA 


TTAATGC C AC 


480 


AAGGTTTGGA 


AGTATTCCGC 


TCGTGCCATA 


TCNCAAATAT 


CCAGNCGANA 


GTGANGTGCT 


540 


GATTTATTTT 


ATTGAAGGTT 


TAATGTCTGA 


NAATCAACCT 


GCCGAAGCGT 


TAGAANANTT 


600 


AAGTTATGTT 


GATCCATCAC 


CTGCNNAGTT 


GATGTTNAAG 


AAATAGTTTT 


GCGANNTGAT 


660 


TATCTAAAAC 


AGTATTCAGA 


ANTTATTTGG 


TAGAAAATA 






699 



(2) INFORMATION FOR SEQ ID NO:234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



30 


(xi> 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 234: 








CCAAACAACC 


TATCCTTCAA 


ACAATTATCC 


AATATTTGTT 


GAACATGGTG 


CAATTGACCA 


60 




TATTAGCACG 


TATATTGATC 


AGTTTGATCA 


AAGTTTTATA 


TTAATTGACG 


AGCATGTAAA 


120 


35 


TCAATATTNT 


GCTGAAAANT 


TTGATTGAAA 


TTTTAATCAA 


ATTGAAAATG 


TCCATAAAGT 


180 




TCATTAATCC 


NAACCTGGTG 


AAAAAGACGG 


AAAACATTTG 


ACCAATATCA 


AAGAAACATT 


240 




AGAATACATT 


NTGGTCACAT 


CATGTAACGC 


GTAATACAGC 


GATTATAGCT 


GTTGGTGGTG 


300 




GTGCGACAGG 


AGATTTTGCA 


GGATTTGTAG 


CAGCAACACT 


ATTAAGAGGT 


GTCCATTTTA 


360 


40 


TACAAGTTCC 


TACAACGATT 


TTGGCGCATG 


ATTCTAGTGT 


TGGCGGTAAA 


G TGGGT ATT A 


420 




ACTCAAAACA 


AGGTAAAAAC 


CTTATCGGTG 


CATTTTATCG 


TCCAACTGCT 


GTGATTTATG 


480 




ATTTAGACTT 


TTTAAAGACG 


TTACCATTTG 


AGCAAATATT 


AAGTGGC TAT 


GCAGAAGTTT 


540 




ATAAGCATGC 


GTTATTGAAT 


GGTGAATCAA 


CGACGCAAGA 


AATCGAACAG 


CACTTTAAAG 


600 


45 


ATAGAGAGAT 


ATTACAGCCA 


TTAAATGGTA 


TTGGATAAAT 


ATATTGCTAA 


AGGT 


654 



(2) INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 880 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 



10 



15 



20 



25 



CTAACTTTCG 


CTTTTTCTTT 


TTCTTCAATG 


TCTTGCACTT 


GTCTTTCAAC 


AATGTTAAAT 


60 


CTTTGTTGTA 


AACCTGCTTT 


TTCTGTTTCT 


AGTTGTTTGA 


TGTCTTCCAT 


ATCAATATTT 


120 


GGATCTGTTG 


CTTTCTGACT 


CAATTCATCA 


TTTTTATTTT 


TTAATTGNTG 


TCCAATCATA 


180 


CCTAAGGATT 


GTTTNAATTC 


ATATAATGTC 


GGCATNTCAT 


TTCCTCCTAA 


TAANTCATTG 


240 


TCATTTTTAA 


AATTTCGCAT 


TCGAGCGTAC 


AATNTNNTCT 


CTNTNNTCTT 


NCTCTTCTAG 


300 


CGACATACTT 


TCTTTAGGTG 


GTTTCAACCA 


ATCCAGATCG 


TATCTNACAT 


CATCAATTTT 


360 


AGTGATTTTN 


TCTACATCTT 


TCTTTAAATC 


TTCTGGGACN 


TTCTCNAAAC GCCTACATTG 


420 


CTCTTTAGAG 


ATACTAGCAG 


CTATTTCATT 


AGCTCCTAAA 


ATTTCATCTA 


TCAAGCCGAA 


480 


AGACAAGGCT 


TCTTCTGCAG 


TAAGCCAAGT 


TTCTGCATCT 


AACATCTGTT 


TTAAGTGTTC 


540 


TTGATCTAAA 


TCTNTTGNTT 


TATCTAAATA 


AGCTGAATTA 


CTAACAGCAT 


CTGTTTTTTC 


600 


AAGTAAATCC 




TTAATTCTTC 


TGCATTACCT 


ACAGTCATAA 


CCCATGAATT 


660 


ATGAATCATT 


AAAAAACTAT 


TNNTGGGGCA 


TAAAAATAGT 


GTCACCACTC 


ATAGGGATAA 


720 


CAACTAGCAA 


TTGGATGCCG 


CTAAGGCATC 


GACATAGATA 


TTANTTNNTG 


GAGGATGCAT 


780 


TTNTAGCATA 


TTGGNTTTTT 


GGATGNTCCN 


CAAATACANT 


GGCTCCAGAT 


GAATTTATAT 


840 


TGAACAACTT 


TTTCAACTGA 


TGTCTCCNNG 


GTCAACTTAG 






880 



(2) INFORMATION FOR SEQ ID NO: 23 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



40 


(Xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO:236: 








AGTATTTTTT 


TGACCGAGAC 


ATGCAAGCCC 


TCCTGGTTGA 


TTATCATTTA 


CATATCGTAT 


60 




CATAACCAAT 


CATAAACAAT 


AGTTTTATTA 


ACTTGTAGAC 


ACAATGTTTG 


CTAAAGTCAT 


120 


45 


TTTTAAAAAA 


TATAGCCAAA 


AAATTAGCTA 


TATATTATAA 


AAGCGTGATA 


TAAATGTTTT 


180 




ATATAACAAA 


GAAATAAAAA 


TCATTTTTTA 


CAAATGGTTG 


TAAGAAAAAG 


ACATGCAGAT 


240 




GTTGTTAAAA 


TTTTAATAAG 


AATCAAGGAG 


GC TAT ATT AT 


ATGGCTAAAC 


TAAATGTAGA 


300 




AGTATTTGCG 


GACGGTGCAG 


ATATTGAAGA 


AATGAAAGCA 


GCTTATAAAA 


ACAAACAAGT 


360 


50 


GGATGGTTTT 


ACAACAAATC 


CT AG TTTAAT 


GCCGAAAGCG 


GCCGTAAGCA GATTACAAAG 


420 




CTTTTGCTGA 


AGAAGCTCGT 


GAAAGAAATT 


CC AGATGC TT 


CAATTTCATT 


TGAAGTATTT 


480 




GCAGACGATT 


TAGAAACTAT 


GGAAAAAGAA 


GCAGCAATTT 


TAAAACAATA 


TGGCGAAAAT 


540 




GTATTTGTTA 


AAATTCCTAT 


TGTAAATACA 


AAAGGTGAAT 


CAACGATTCC 


TTTAATTAAA 


600 


55 


AAACTTTCAG 


CTGACAATGT 


GAGATTAAAC 


GTTNCGGCTG 


TTTACACAAT 


TGAACAAGGT 


660 
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AAAGAAATAA CTGAAGCAGT AACTGAAGGT GTGCCCAACA TATGTTTCAG TATTTGCAQG 720 
ACGTATTGCA GATACAGGCG TAGATCCATT ACCATTAATG AAAGAGGCTG TAAAAGTTAC 780 
GC A TAG T AAA G AC GGCGTTA AATTATTATG GGCAAGTTGC CGCTCGTGCC 830 

(2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 789 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO:237: 



25 



30 



35 



NTTAATTTTA 


AAATGTACAA 


TGATATNTGT 


GAAAGCGCTT 


GCTTAGGAGG 


TGTATNTGAG 


60 


AGTGAATGAA 


ATGAATGCTA 


AAGAACNATT 


AGTGGACNAT 


TTAATGAAAA 


CATCATCGCA 


120 


ATTATTTAAA 


TTNCACGGTG 


AAGTTGNCAT 


GCAGCTTNTC 


TTAAATGATG 


AATTAAAATT 


180 


ACCTTCTATT 


GNTGAAATAT 


GCGTGGAACG 


TAAGCGTTTA 


AGTGATATTG 


TGAAAGTTAT 


240 


TCCGCAATCA 


TATGCGTTAC 


TATACATAGA 


TAAGCAAGAT 


CAAGCAAGAG 


CTAAAGANNA 


300 


TTTATCACTT 


NCAAAAATTG 


CAAAAGTTTA 


TGTGCAATAT 


GATGATACAA 


CAATAATGAG 


360 


TATTTTCGTT 


TATGATGTAG 


TAAACGATGA 


ATGGATTTTA 


AGATTGGATC 


CGAATATACG 


420 


TATACCTAAG 


AGTAACATAT 


ACTTCCATAG 


TTTAAATTGG 


GATGTGGATT 


ATATTAAACC 


480 


GGAGATCGTT 


CTAATGTATG 


ATCTAATGCA 


ACACCATCAG 


TATCATCATT 


ATTCCAATTA 


540 


TAAACGAGTC 


ATAGATGCNT 


T AAGC TACT A 


TCAATTTTTT 


ATTTTAAAAT 


TTGTAGTAGG 


600 


TGAGCNACGT 


ATTAAGGATG 


CAATCCAGAA 


GAACAATAAA 


TAATTAAGAA 


AAAGCAATTC 


660 


ATAACGCAGT 


TGAATACATG 


TGTTACGAAT 


TGCTTTTATA 


TTAGTTTTTA 


TCACACAAGT 


720 


TTTTTAATGC 


AACNCCGTGA 


TAGCAAAACT 


CATATGTAGA 


TAATACAGCT 


TTTTCAGCAT 


780 


CATCTACAT 












789 



(2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 

ATATTCGAGA CGACCCTAAN TAATTGTGTT GTCTGTCATA CTGGGNCANG ATTTGTGTGN 60 



197 
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10 



CTANATACNA 


CTNATCATTA 


GC ANTAGATT 


GTTTATATGT 


TGTTGAGTGG 


CAAAATTGTT 


120 


GTGGTGCGTG 


TGGGGTTGCN 


TGCTGAGATG 


AGTGAGTATA 


TTGTTGAGTA 


TTATATGGCT 


180 


CTGGTGTTGG 


GTGTTGCGCC 


TGTGGCACGT 


TAGNCTCGAG 


CCGCTGGGTG 


TGTATTATCA 


240 


TCAGTTTTCT 


TCTGAGTATC 


ATCTGAGNTA 


TCTTGAGATG 


CATTGTCATC 


CTTATCTTTC 


300 


GACTCATCCT NTGATGCTTT 


ATCATCATTC 


TCNTCTTTAG 


CAGGACGTTG 


CTTTGTAGGT 


360 


GCTTGTTGCT 


GAGGTGGCAC 


ATATTGATAT 


TGATTCTGCT 


GTGGCAATTG 


CTGATATTGG 


420 


TTTGCTTGTG 


AATCTAGCTC 


AGCCTGCTTT 


TTTTCCTCTT 


GTTGCTTTNN 


CTTTC TCTCT 


480 


TTATCTGCAA 


TTTCTTTTTG 


ACGCTTTTCT 


TTTTGTTCCT 


GCTTTTTGTT 


CGTTCAACAT 


540 


ACGTTCTTTA 


GCTTTATTCG 


AATGAATCTA 


CATATGCAAA 


AATTGCAAAT 


ACTAAATCCT 


600 


CCCGA 












605 



15 

(2) INFORMATION FOR SEQ ID NO: 23 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 683 base pairs 

20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 



35 



40 



GGCACGAGAC 


TGGTGGAGGT 


TCAGGTAACT 


CGCCGGCACG 


ATTAATGCCT 


GATAAAATAT 


60 


TGGATTTAAA 


GTATAAGTTG 


CGTGTTAATA 


ATGTACCGAC 


ACCAAGAACA 


GTAACATTTA 


120 


ATGATACATT 


AACATATAAA 


ACATATACAC 


AAGATTTTAT 


TAATTCACCT 


GCTGAAAGTC 


180 


ATACTGTAAG 


TACAAATCCA 


TATACAATCG 


ATATCATCAT 


GAATAAAGAT 


GCATTAGAAG 


240 


CCGAGGTTGA 


TAGACGCATG 


GCAACAAGCT 


GATTATACAT 


TTGCATCATT 


AGATATCTTT 


300 


AATGATCTTA 


AAAGACGTGC 


ACNAACGATT 


TTAGATGAAA 


ATCGTAACAA 


TGTACCATTA 


360 


AATAAAAGAG 


TTTCTCAAGC 


ANATATTGAT 


TCATTAACTA 


ATCAAATGCA 


ACATACGTTA 


420 


ATTCGAAGTG 


TTGATGCTGA 


AAATGCNGTT 


AATAAAAAAG 


TTGACACAAA 


TGGAAGATTT 


480 


AGTTAATCAA 


AATGATGANT 


TGACAGATGA 


AGAAAAACAA 


GCNGCAATAC 


AAGTTATCGA 


540 


GGAACATAAA 


AATGAAATAA 


TTGGTAATAT 


TGGTGACCAA 


ACGACTGATG 


ATGGCGTTAC 


600 


TAGAATCACA AGATCAAGGT 


ATACAGACCT 


TAAGTGGGGA 


TACTGCAACA 


CCGGTTGTTA 


660 


AACCAAATGC 


TAGGAAAAGC 


AAT 








683 



45 

(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 932 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(.ii) MOLECULE TYPE: Genomic DNA 
{xi) SEQUENCE DESCRIPTION: SEQ ID NO:240: 



10 



15 



20 



25 



TGATGATTAG 


ATAAATTGAA 


ACAACAACGT 


ATTGAGTTGA 


ATGAACAAAT 


CGATGCGCAG 


60 


GAAGCTACAT 


ACAAGTTTGT 


CACCAAGATA 


TTTTAGCTAT 


CGAAAATrAC 




120 


TTAAAGCTGA 


ACAATCAAAG 


TTAGGATGTA 


TTAATTCATC 


ATGCGATAGA 


TCATTTAAAT 


180 


GATGAATATC 


AATTGACTGT 


TGAACGTGCG 


AAATCTGAAT 


ATACGAGTGA 


TGAATCGATT 


240 


GACGCATTAC 


GTAAAAAAGT 


TAAGTTAATG 


AAGATGTCCG 


ATTGATGAAC 


TAGGTCCTGT 


300 


AAACTTAAAT 


GCAATTGAAC 


AATTTGAAGA 


GTTAAATGAA 


CGTTATACAT 


TTTTAAGTGA 


360 


ACAACGTACA 


GATCTTCGTA 


AAGC TAAAGA 


AACATTAGAG 


CAAATTATAA 


GTGAAATGGA 


420 


TCAAGAGGTT 


ACTGAAAGAT 


TTAAAGAAAC 


TTTCCATGCT 


ATTCAAGGAC 


ATTTTACAGC 


480 


TCGTGCCCAA 


ACAATTGTTT 


GGTGGAGGCG 


AT GC AG AA TT 


GCAATTAACT 


GAAGCCGATT 


540 


ATTTAACAGC 


TGGTATTGAT 


ATTGTGGTAC 


AACCACCGGG 


TAAAAAGTTG 


CAACATTTAT 


600 


CGTTACTGAG 


TGGTGGTGAG 


CGTGCATTAA 


CTGCTATTGC 


TTTAC TATTT 


GCAATTTTAA 


660 


AAGTAAGATC 


TGCACCTTTT 


GTTATATTAG 


ATGAGGTTGA 


AGCTGCACTA 


GATGAAGCAA 


720 


ATGTTATTAG 


ATACGCAAAA 


TATTTAAATG 


AGTTATCAGA 


CGAAACACAA 


TTCATTGTTA 


780 


TTACACACCG 


TAAAGGAACA 


ATGGAATTTG 


CAGATAGGTT 


ATACGGTGTA 


ACAATGCAAG 


840 


AATCAGGTGT 


TACTAAACTT 


GTGAGTGTGA 


ATTTAAATAC 


AATAGATGAT 


GTGTTGAAGG 


900 


AGGAGCAATA 


ATGAGCGGCA 


CGAGCTCGTG 


CC 






932 



(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 966 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



40 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 241: 








CGGTTAATCG 


GTAGTATTTT 


AGGTTCNGTT 


GTTATTAAGA 


TTTGGGAAGA 


TTTGATCTTC 


60 




CATCCCCCAA 


AGGGNTTAAC 


CGTATGGGGA 


TNCNTCANAA 


TCATTTGCCC 


AAGCTAAACC 


120 


45 


ACCAAAGAAT 


GGATATTTGA 


ATCCGGTTGC 


ATTTAAGNTC 


AATTGGGATG 


ATTTTTAGCC 


180 




TCCAGTTATT 


AAGTTCAAGT 


GTTAAATACC 


CTGTTCTAAT 


TTTAATTAAT 


TCTATTAATT 


240 




TAAGATACTT 


AAAATGATAA 


GACACTAAGA 


AAGGGAGGCT 


ATTAGTAATA 


ATGCCCAAAA 


300 




ATAAAAGCAA 


ATG AAGC ATT 


AGTTAAAGCA 


TTACAAGCAT 


GGGATATAGA 


TCACTTGTAT 


360 


SO 


GGTATTCCAG 


GAGACTCAAT 


CGACGCAGTT 


GTCGATTAGT 


TTAACGTACA 


GTGAGAGATC 


420 




AATTTAAATT 


TTATCATGTA 


CGTCATGAAG 


AAGTAGCAAG 


CTTAGCGGCT 


GCTGGTTACA 


480 




CAAAATTAAC 


TGGTNAAATC 


GGTGTGGCAT 


TAAGTATCGG 


TGNCCCTGGT 


TTAATTCATT 


540 




TATTAAATGG 


TATGTACGAT 


GCCAAAATGG 


ATAATGTACT 


CGTGCCAATT 


AATATTATCT 


600 


55 


GGACAAACNG 


AATAGTACAG 


CACTTGGAAC 


GAAAGCATTT 


CAAGAAACAA 


ATTTACAAAA 


660 
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ATTATGTGAA GATGTAGCCG TTTATAATCA CCAAATTGAA AAAGGTGACA ATGTGTTTG A 720 

AATCGTTAAC GAAGCAATTC GTACGGCATA TGAACAAAAA GGTGTCGCTG TTCTTATT7G 780 

5 TCCTAACGAC TTATTAACTG AAAAAATTAA AGATACAACG AATAAACCAG TAGATACATC 840 

AAGACCAACA GTTGTATCAC CAAAATATAA AGACATCAAA AAAGCGGTTA AACTAATTAA 900 

TAAAAGTAAA AAGCCTGTCA TGTTAATTGG TGTAGGTGCA AAACATGCGA AAGATGAGCT 960 

CGTGCC 96 6 

10 

(2) INFORMATION FOR SEQ ID NO: 242: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1396 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 



30 



35 



40 



45 



50 



CTGTGAAATT 


AAGTCGNTAT 


TACCCGCAAG 


GATTAAGAAG 


TTTGAATGGT 


GGTCGCATGG 


60 


CAAGATTTGG 


ACGTACACCA 


TTAC TTGATG 


CAATGGAGAT 


GGCTAATGAG 


CATATTATGG 


120 


TGATTGCCAT 


GATAGAAGAT 


GTTGANGGGG 


TTATGGCCAT 


TGACGATATA 


GCTCAAGTCG 


180 


AAGGTTTAGA 


CATGATAGTC 


GAAGGTGCCG 


CAGGATTTAT 


CGCAGTCACT 


TGGCATACCA 


240 


TNGCAAACGA 


GCGTGATGAT 


CAAGTAACAT 


CACATNTTCA 


ACATATTTNT 


GNGGTTGTGA 


300 


ATGCACATGG 


TAAACATTTN 


TGTGCATTAC 


CACGTGAAGA 


TGAAGATATT 


GCAAAATGGC 


360 


AGGCACAAGG 


TGTACAAACA 


TTTATTTTAG 


GCACGAGCGG 


AAAAATATAT 


CGCCATTTAA 


420 


GTGCATCTCT 


AGCGACGTCT 


AAACAGAAAG 


GGGATGATGG 


CTAATGCGTA 


TAGTTCAACC 


480 


TGTTATTGAA 


CAATTAAAAG 


CACAATCTCA 


TCCAGTTTGT 


CATTATATCT 


ATGATTTAGT 


540 


CGGACTGGAA 


CATCATTTGC 


AACATATTAC 


ATCGTCATTG 


CCGAGTAATT 


GTCAAATGTA 


600 


CTATGCAATG 


AAAGCAAATA 


GTGAACGAAC 


AATCCTAGAT 


ACAATTAGTC 


AGTATGTTGA 


660 


AGGATTCGAA 


GTTGCATCTC 


AAGGTGAAAT 


AGCAAAAGGT 


CTTGCTTTTA 


AACCAGCAAA 


720 


TCATATTATT 


TTTGGTGGCC 


CTGGTAAGAC 


AGACGAGGAA 


CTAAGATATG 


CAGTAAGTGA 


780 


AGGTGTTC AG 


CGTATTCATG 


TTGAAAGTAT 


GCATGAATTA 


CAACGGCTAA 


ATGCCATCTT 


840 


AGAAGATGAA 


GATAAGACAC 


AACACATTTT 


ATTGCGTGTT 


AATTTAGCAG 


GACCATTTCC 


900 


CAATGCAACG 


TTGCATATGG 


CAGGACGCCC 


AACACAATTT 


GGTATTTCTG 


AAGACGAAGT 


960 


TGATGATGTC 


ATTGAAGCTG 


CGCTAGTAAT 


GCCAAATATT 


CATCTAGATG 


GCTTTCATCT 


1020 


TCATTCCTAT 


TTCTAACAAT 


TTAGACTCGA 


ATTTACATGT 


CGATGTAGGG 


GAAACTTTAT 


1080 


TTTAAAAAAG 


CAAAATCATG 


GCCTTGAAAA 


ACATCGATTT 


CCACTCAAAC 


AT ATCAATC T 


1140 


TGGTGGGTGG 


CATAGGCGGT 


CAACTATGCA 


GATTTAACTA 


GGCCAACTNG 


AGTGGGATAA 


1200 


TTTTGNNGAA 


AATTTNAAAA 


C AC TTATCGT 


TGAGCAAGAA 


ATGGAAGATG 


TGACATTGAA 


1260 


CTTTGAATGT 


GGGCGCTNTA 


TTGTGGCACA 


TTGGGGTTAC 


TATGTGACAG 


AAGTGCTAGA 


1320 


TATTANGAAA 


GNGCATGGCG 


CTTGGTATGC 


CATTTNAAGA 


GGAGGTACGC 


AACAACNTAG 


1380 


CCTGCCGGNA 


TCTTGC 










1396 
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15 



25 
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. (2) INFORMATION FOR SEQ ID NO:243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3918 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 



AC C AAAGCC A 


AATACTTGCA 


TTAANNCGGG 


GTAATTCCAA 


CAACTTAAGA 


ACNGTGCGAA 


60 


ACNAGCCTGC 


GGAACNACAA 


GCGATTACCA 


AATGCTTCGG 


GACNTTAAAA 


TACCACAACA 


120 


CAAAGAAGCA 


TTGTAAGCAC 


AAGTAACAAG 


TGCAGGACGC 


GTATCTGCAG 


CANATGGTGT 


180 


TGAACATACT 


GCGACTGAAT 


TAAATACTGC 


GATGACAGCT 


TTAAAACGTG 


CCATTGCTGA 


240 


TAAAGCTGAC 


ACACAAGCTA 


GTGGTAATTA 


TGTCAATGCT 


GATGCGAATA 


AACGCCAAGC 


300 


ATATGATGAA 


AAAGTGACAG 


CTGCAGAACA 


TATCGTTAGT 


GGTACACCAA 


CACCAACGTT 


360 


AACNCCATCA 


GATGTTACAA 


ATGCAGCAAC 


GCAAGTAACG 


AATGCGAAGA 


CGCAGTTANN 


420 


CGGTAATCAT 


AATTTAGAAG 


TAGCGACNCA 


AAATGCTAAC 


AC AGC AATTG 


ATGGTTTAAC 


480 


TTC TTTAAAT 


GGTCCTCCAA 


AAACCCAAAA 


CTTAAAGAAC 


AAGTGGGTCA 


AGCGACNGAC 


540 


GTTNCCAAAT 


GTTCAAACTG 


TTCGTGATAA 


TGCACAAACA 


TTAAACACTG 


CAATGAAAGG 


600 


TCTACGAGAT 


AGCATTGCGA 


ATGAAGCAAC 


GATTAAAGCA 


GGTCAAAACT 


ACACAGATGC 


660 


AAGTCAAAAC 


AAACAAAATG 


ACTACAACAA 


TGCAGTCACT 


GCAGCANAAG 


CAATCATTGG 


720 


TCAAACAACT 


AGTCCATCAA 


TGATTGCGCA 


AGAAATTAAT 


CAAGCGAAAG 


ACCAAGTGAC 


780 


AGCTAAACAA 


CAAGCGTTAA 


ACGGTCAAAG 


AAAACTTAAG 


AACTNCGCAA 


ACCAAATGCG 


840 


AAGCAACAAT 


TGAATGGCTT 


AAGTGACTTA 


ACTTAATGCC 


CCCCAAAGAT 


GNAGCGANAC 


900 


CNCCAAATCG 


AGGTGCAACG 


CATGTTAATG 


GAAGTAAACA 


CCCAAGCCCA 


CAATAATGGG 


960 


GACGGCATTA 


AAATACAAGC 


TATGNCGGNA 


CTTGTNAAAT 


GGTAATTCAA 


AGACTCACAA 


1020 


TNCGGATTAA 


GCAAGGTGTT 


AACTTCACTT 


GATGCAGATG 


AAGCGAAACG 


TAATGCATAT 


1080 


ACAAATGCAG 


TGACGCAAGC 


TGANCAAATT 


TTAAATAAAG 


CACAAGGGCC 


AAATACTGCA 


1140 


AAAGACGGTG 


TCGAAACTGC 


GTTACAAAAT 


GTACAACGTG 


CTAAAAACGA 


ATTGAGCGGT 


1200 


AATCAAAATG 


TTGCGAACGC 


TAAGACAACT 


GCGAAAAATG 


CATTGAATAA 


CCTTACATCA 


1260 


ATTAATAATG 


CACACAAAGC 


AGCATTGAAA 


TCACAAATTG 


AAGGTGCGAC 


AAC AG TTGC A 


1320 


GGTGTAAATC 


AAGTGTCTAC 


AATGGCATCT 


TGAATTAAAT 


ACCTGCAATT 


GAGCAACTTA 


1380 


CCAACGTGGT 


ATTAATGACG 


AAGCAGCTAC 


AAAAGCAGCT 


CAGAAATATA 


CTGAAGCAGA 


1440 


TAGAGATAAA 


CCCANCCTGC 


ATACAATGAT 


GCTGTAACAG 


CAGCTAAAAC 


GT T ATTAG AT 


1500 


AAAACAGCTG 


GTTCAAATGA 


CAATAAAGTA 


GCCGTTGAAC 


AAGCATTACA 


ACGTGTGAAT 


1560 


ACTGCTAAAA 


CAGCATTAAA 


TGGTGACGCG 


CGATTAAATG 


AAGCGAAGAA 


CACAGCTAAA 


1620 


CAACAATTAG 


CGACAATGTC 


ACATTTAACT 


AATGCTCAAA 


AAGCAAACTT 


AACAGAACAA 


1680 


ATTGAACGTG 


GTACAACTGT 


TGCTGGTGTT 


CAAGGCATCC 


AAGCAAATGC 


TGGTACTTTA 


1740 


AATCAAGCAA 


TGAATCAATT 


AAGACAAAGT 


ATTGCTTCTA 


AAGATGCGAC 


TAAATCAAGC 


1800 


GAAGATTATC 


AAGACGCGAA 


TGCAGATTTA 


CAAAATGCAT 


ACAATGATGC 


GGTAACTAAT 


1860 


GCTGGAGGTA 


TTATTAGTGC 


ANCGAATAAC 


C CTGAAATG A 


ATCCTGATAC 


AATTANCCAA 


1920 



201 



EP 0 841 394 A2 



10 



15 



AAAGCGAGCC AAGTGAACAG TGCGAAGTCT GC ATTGANCG GTGATGAAAA ATTAGCAGCA 1980 

GCAAAACAAA CTGCGAAATC AGATATCGGT CGTGTGACAG ACTTGAACAA TGCACAACGA 2040 

AC TGCGNC AA ATGCTGAAGT GGATCAAGCA CCAANTCTTG CAGCTGTCAC AGCGGCTAAA 2100 

AATAAAGCAA CATCGTTAAA CACAGCGATG GGTAATGTGA AACATGCACT TGCTGAAAAG 2160 

GATAATACGA NACGTAGTGT CAATTACACA GATGC GGATC AACCAANACA ACAAGCGTNT 2220 

GATACTGCAG GTACACAAGC AGAAGCAATT ACTAATGCAA ATGGCAGTNA CGCGAATGAA 2280 

ACACAAGTTC AAGCAGCGCT TAACCAATTG AATCAAGCTA AAAACGACTT GGAATGGGTG 2340 

ATAATAAAGT TGCTCAAGCG AAAGAAACAA CAAAACGTGC ATTAGCTTCA TATAGTAACT 2400 

TGAATAACGC GCAATCAACT GCAGCAACTA GTCAAATTGA CAATGCAACG ACAGTAGCAG 2460 

ACGTAACTGC TGC AC AAAAT ACTGCTAATG AATTAAATAC AGCAATGGGT C AACT TC AAA 2520 

ATGGTATTAA TGACCAAAAC ACTGTTAAAC AACAAGTGAA CTTTACAGAT GCTGACCAAG 2580 

GTAAGAAAGA TGC TT AC AC A AATGC TGTT A CGAATGCTCA AGGTATTTTA GATAAAGCAA 2 64 0 

CACGGTCCAA AATATGNCAA AAGCACAAGT TGAAGCTGCA TTAAATCNAG TANCGNCTGC 2700 

TAAGAATGCT TTAAACGGTG ATGCAAATGT NAGACAANCA AAATCAGATG CG AAA GC AAA 2760 

CTTAGGTACA TTAACAC AC T TANATAATGC ACAAAAACAA GATTTAACAT CACAAAGCGN 2820 

NGGTGCAACA ACAGNCAACG GTGTAAATNG TGTTAAAACG AAAGCACNAG ACTTAGATGG 2880 

TGC AATGC AA CGATTAGAGT CAGCAATCGC AAATAAAGAC CAAACTAAAG CGAGCGAAAA 2940 

CTACATTGAC GCAGATCCAA CTAAGAAAAC AGCATTTGAT AATGCCATCA CACAAGCTGA 3000 

ATCTTACTTA AATAAAGATC ATGGTACGAA TAAAGATAAG CAAGCTGTTG AACAAGCAAT 3060 

TCAAAGTGTA ACGTCTACTG AAAATGCTTT GAACGGTGAC GCGAACTTAC AACGCGCTAA 3120 

AACTGAAGCT ACACAAGCTA TCGATAACTT GACACAATTG AATACACCGC AAAAAACAGC 3180 

ATTGAAACAA CAAGTGAATG CTGCACAACG CGTATCAGGT GTAACTGATC TGAAAAATAG 3 240 

TGCTACATCA CTTAATAATG CGATGGATCA ATTAAAACAA GCAATTGGTG ATCATGACAC 3300 

AATTGTAGCT GGTGGTAATT ACACTAACGC AAGTCCTGAT AAACAAGGTG CTTACACTGA 3360 

TGCATATAAT GCTGCGAAGA ATATCGTAAA TGGTTCACCT AATGTGATTA CAAATGCAGC 3420 

AGATGTTACT GCGNCAACAC AACGTGTCAA T AATGC TGAA ACAAGTTTAA ATGGTGAGAC 3480 

AAACTTAGCA ACTGGCGAAG CAACAAGCTA AAGATGCATT ACGTCAAATG ACACATTTAT 3540 

CTGATGCACA ANAAACAAAG TATTACTGGT CAAATTGATA GC GCG AC AC A AGTAACTGGT 3 500 

GTACAAAGTG TGAAAGACAA TGCAACAANT C TTGAC AATG CAATGAATCA ACTTCGAAAT 3 660 

AGTATTGCGA ATAAAGATGA AGTAAAAGCG AGTCAACCAT ATGTTGATGC AGATACAGAT 3720 

AAACAAAATG CATACAATAC AGCAGTTACA AGTGCTGAAA ATATCATTAA TGCAACGAGT 3780 

CAGCCAACAC TTGATCCATC TGC AG T AAC A CAAGCAGCTA ATCAAGTGAA C AC T AAC AAA 3 840 

ACTGCGC TT A ATGGTGCGCA AAACTTAGCA NATAAAAAGC AAGAAACAAC TGCTAACATC 3 900 

ACCCGATTAA GTCATTTA 3 918 

45 (2) INFORMATION FOR SEQ ID NO: 244: 

(1} SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 65 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

5 



70 



15 



20 



25 



30 



ATGCGATACG 


CTT AAC CAGC 


AATGCATGGT 


TTAAGACAGA 


GCATTCAAGA 


T AACGC AGC A 


60 


NCTAAACCGA 


ATAGCAAATA 


TATCAACGGA 


AGTCCACGAG 


AGCAACCAAA 


ACTATGATCA 


120 


AGCTGTTCAA 


GCCCCAAATA 


ATATTATCAA 


TGAACAAACT 


GC AACATTAG 


ATAATAATGC 


180 


GATTAATCAA 


GTAGCGGCAA 


CTGTGAATAC 


AACGAAAGCA 


GCATTACATG 


GTGATGTGAA 


240 


ATTACAAAAT 


GATAAAGATC 


ATGCTAAACA 


AACGGTTAGC 


CAATTAGCAC 


ATCTAAACAA 


300 


TGCACAAAAA 


CATATGGAAG 


ATACGTTAAT 


TG AT AG TG AA 


ACAACTAGAA 


CAGCAGTTAA 


360 


GC AAGATTTG 


ACTGAAGTAC 


AAGCATTAGA 


TCAACTTATG 


GATGCATTAC 


AAC AAAG TAT 


420 


TGCTGACAAA 


GATGCAACAC 


GTGCGAGCAG 


TGCATATGTC 


AATGCAGAAC 


CGAATAAAAA 


480 






1 1 Ln/UiA l ot 








^ A ft 

J Si U 


AACTATCAAT 


AAAGGTAATG 


TATCAAGTGC 


GACTCAAGCA 


GTAATATCAT 


CTAAAAATGC 


600 


ATTAGATGGT 


GTTGAACGAT 


TAGCTCAAGA 


TAAGCAAACT 


GCTGGAAATT 


CTCTAAATCA 


660 


TTTAGATCAA 


TTAACACCAG 


CTCAACAACA 


AGCGCTAGAA 


AATCAAATTA 


ATAATGCAAC 


720 


AACTCGTGAT 


AAAGTGGCTG 


AAATCATTGC 


ACAAGCGCAA 


GCATTAAATG 


AAGCGATGAA 


780 


AGCATTAAAA 


NAAAGTATTA 


AGGATCAACC 


ACAAACTGAA 


GCAAGTAGTA 


AATTTATTAA 


840 


CGAGGATCAA 


GCGCAAAAAG 


ATGCATATAC 


GCAAGCAGTA 


CAACACGCTC 


GAAGATGCCT 


900 


TGNTTAACAA 


AACAACTGAT 


CCTACATTAG 


CTAAATCAAT 


CATTGATCAA 


GCGACACAGG 


960 


CAGTGACTGA 


TGCTAAAAAC 


AATTTACATG 


GTGGATCAAA 


AACTAGCTCA 


AGATNAGCAA 


1020 


CGTGCAACAG 


AAACGTTAAA 


TAACTTGTCT 


NACTTGAATA 


CACCACAACG 


TCAAGCACTT 


1080 


GANAATCAAC 


TCAATTCNTG 


CAGCAACTCG 


NCGNGANGTA 


GCACANAAAT 


TTACTGANGC 


1140 


ACANGCACNT 


TATCCACGCA 


ATGGANGCTT 


TNCGTCATAG 


CATTC AC GNN 


NCAACAAACA 


1200 


AAACAGATCT 


GGTNTGCAAN 


TTTTTTATTG 


AAGGTTNNNC 


CNCACCCCGG 


TGCTTCCNGC 


1260 


GCCAC 












1265 



(2) INFORMATION FOR SEQ ID NO: 2 45: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: Genomic DNA 

45 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 





TTCATTAGTG 


GTGTGGCACG 


ATAGCTCAAN 


GATAACCAAA 


CTCCTGGAAA 


TTCTCTAAAT 


60 


50 


TCATTTAGAT 


CAATTAACAC 


CAGCTCAACA 


ACAAGCGCTA 


GAAAATCAAA 


TTAATAATGC 


120 




AACAACTCGT 


GATAAAGTGG 


CTGAAATCAT 


TGCACAAGCG 


CAAGCATTAA 


ATGAAGCGAT 


180 




GAAAGCATTA 


AAAGAAAGTA 


TTAAGGATCA 


ACCACAAACT 


GAAGCAAGTA 


GTAAATTTAT 


240 




TAACGAGGAT 


CAAGCGCAAA 


AAGATGCATA 


TACGCAAGCA 


GTACAACACG 


AGCGAAAGAT 


300 


55 


TTGATTAACA 


AAACAACTGA 


TC C T AC ATT A 


GCTAAATCAA 


TCATTGATCA 


AGCGACACAG 


360 
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5 



10 



GCAGTGACTG 


ATGCTAAAAA 


CAATTTACAT 


GGTGATCAAA 


AACTAGCTCA AGATAAGCAA . 


420 


CGTGC AACAG 


AAACGTTAAA 


TAACTTGTCT 


AACTTGAATA 


CACCACAACG 


TCAAGCACTT 


480 


GAAAATCAAA 


TCAATAATGC 


AGCAACTCGT 


GGTGAAGTAG 


CACAAAAATT 


AAC TGAAGC A 


540 


CAAGCACTTA 


ACCAAGCAAT 


GGAAGCTTTA 


CGTAATAGCA 


TTCAAGATCA 


ACAACAAACA 


600 


GAATCTGGTA 


GCAAGTTTAT 


TAATGAAGAT 


AAACCGCAAA 


AAGATGCTTA 


CCANGCAGCA 


660 


GTTCAANATG 


CAAAAGATTT 


AATTAACCAA 


ACAGGTANTC 


CAACGCTTGA 


TAAAGCACAA 


720 


GTTGAACAAT 


TGACACATGC 


TTTTAAACAA 


GCTAAAGATA 


ACCTACACGG 


TGATCAAAAA 


780 


CTTGCAGACG 


A 










791 



(2) INFORMATION FOR SEQ ID NO: 246: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

20 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:246: 



30 



35 



40 



45 



50 



55 



CTATTAAGGA 


ATCCTTGNNA 


AATTCACATT 


AGCAAGTTGG 


ATTGTCCACC 


TTATTAATTG 


60 


ATAAAGTATG 


CAAATAATTT 


CACAGTGTTT 


CCAGCGGGGC 


CAANTTATGA 


TTGCAAGTAA 


120 


CCTATAGCAA 


TCGTATTCTT 


GTTCTTGCCA 


ACGCTATTTA 


AGTATCAGGT 


TTAACCAACA 


180 


GGTGCGACAA 


AAGGTTAGTT 


TGAAATTAGG 


GAGTGGGGCA 


GAATTGATAA 


AGAAC C ACTA 


240 


ATGACGATAA 


AGATTAAAAG 


GGAGGACGTT 


ATGATGACGG 


ATTAAAGTTG 


GAATCATTGG 


300 


GTGTGGTGGT 


ATTGCGAATG 


GCAAGCAACA 


TGCCAAGTTT 


ACNAAAAGTT 


GAAAATGTTG 


360 


AAATGATCGC 


ATTTTGTGAC 


GTAGACATTT 


CGAAAGCAGC 


GAGTGCGGCA 


GAAGCATACG 


420 


GAACTGACAA 


TGCAAAGGTT 


TATGATGATT 


ACAAAGCATT 


GTTAAAAGAT 


GACACGATTG 


480 


ATGTTATCCA 


TGTTTGTACG 


CCAAATGACT 


CGCATTGTGA 


AATT AC TGT A 


GCAGGGTTGC 


540 


ATGCCGGTAA 


GCATGTGATG 


TGTGAAAAAC 


CAATGGCTAA 


AACGACAGCA 


GAAGCTCAAA 


600 


AAATGATAGA 


TACAGCTAAA 


TCAACAGGTA 


AAAAATTAAC 


AATAGGTTAT 


CAAAATCGCT 


660 


TCAGACCAGA 


TAGTCAATTT 


TTACATCAAG 


CAGCGCAACG 


TGGCGACTTA 


GGAGACATTT 


720 


ACTTCGGAAA 


GGCACATGCC 


ATTCGTCGTC 


GTGCAGTACC 


GACATGGGGT 


GTCTTTCTAA 


780 


ACGAAGAAGC 


CCAAGGTGGA 


GGACCTTTAA 


TCGATATTGG 


TACGCACGCA 


TTAGATTTAA 


840 


CGTTATGGAT 


GATGGATAAT 


TATGAACCAG 


AATCAGTGAT 


GGGTTCAACA 


TTCCATAAAT 


900 


TAAATAAGCA 


GCATGATGCG 


CCAAACGCTT 


GGGGTTCATG 


GAATCCAGAT 


GAATTAACAG 


960 


TTGAAGACTC 


TGCGTTTGGC 


TTTATTAAAT 


GAAGAACGGA 


GCGACGATCA 


TTTTAGATCC 


1020 


GCTTGGGCGA 


TTAATTC TTT 


AGAAGTGGAT 


GAGGCAAAAT 


GTTCATTATT 


AGGAACGAAA 


1080 


GCAGGTGCTG 


ATATGAAAGA 


TGTTCTACGT 


ATTCATGGTG 


AAGATATGGG 


CACACTTTAT 


1140 


ACCAAACACG 


TTGAATTNGA 


AAACAAAGGC 


GTCGAC TTTT 


ATGAAGGTAA 


TGAAGTGGAT 


1200 


GAAGCTGAAG 


AAGAANCAAA 


AGCTTGGATT 


GATGCAGTTG 


TAAATGATAC 


TGAACCAGTT 


1260 


GTGAAACCGG 


AACAAGCAAT 


GGTAGTTACA 


AAAATTCTTG 


AAGCGATTTA 


TCAGTCTGCA 


1320 


AAATCAGGCA 


AAGCAATTTA 


CTTTGAATAA 


CATCATACGG 


TAAGGAGGCA 


CATCATGACA 


1380 
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20 



30 



40 



45 



SO 



AAATTAAAA{1 

AAA X X / *-^Vr^rl ^ 


l 1 I\3 lun 1 


AGGTGTTGGT 
x vj x x wv x 


GGCATTGCAC 


AAGACCG^CA 

AAOAV V V-J * \_ r\ 


TATTrPAflTA 


1440 


TTGCTGAAAC 


TCAAAGACAC 


AGTCTCATTA 


GTTGCAGTAC 

VJ X X VJT V*AU X *V V— 


AAGATATTAA 


TACAGTGGAG 

X A\. X wV- ^VVJ 


1500 


ATGATTGATG 


TTGCGAAGCG 


C TTT AAT AT A 

XXX X ** X ** 


CCTCAGGCAG 


TTGAG AC AC C 


TAGCGAGCTG 

1 Av\_UAVJL 1 W 


1560 


TTTAAAC TTG 


TTGATGCGGT 


GGTCATTTGT 

0\J X XXX w x 


AC AC CC AAT A 


AAA * U^A 1 


TG ATCTTTCT 

X NJ<V X W X X X V*» x 


1620 


ATAGAAGCAT 


TGAACCATGG 


TGTCCATGTC 


TTATGCGAAA 


AGCCAATGNC 


AA^GACGACG 


1680 


G JLAG AGTGTG 


ATPGCATYSAT 


TGAAGCGGCT 


AATAAAAATG 


AC A AA T'TATT 
nvnnn x X r\ X X 


AACTGTCGCT 


1740 


TATCATTATC 


GTCACAGAGA 


TGTCGCTATG 

X X V_\J\- X <** X V? 


ACTGCTAAAA 


AAGC AA TTG A 


AGGAGGTGTG 


1800 


GTTGGTAAAC 


CATTAGTAGC 


ACCGTTGTCC 


AANCGATGCG 


TAGGCGTAAA 


GNAPT'TGGGT 

UiNM^V X VjVj\J X 


1860 


GGGGCG TT TT 


TACCCAATAA 

X AWv\-AA X 


AACCGTTGCA 

AAV>- WX X 


AGGTGGCGGT 


AGT^TAA TCG 

«*w X X X r\r\ X x- Vj 


ATTATGGTTT 

f\ X X s \ X VJO 1 X X 


192 0 


CGCAfTTGTT 




TTTG TGGGT A 

X X X \J X VJVJU Xrt 


TTAGGTAAAG 


ATATGGTGPP 


GC ATGA AGTG 


1980 


CTAGGAAAAA 


CATATAATCA 


ATTGAGCAAA 


CAAGCGAATC 


AAA^TAATGA 

AAA « X X VJA 


TTGGGG A A C A 


2040 


TTTGATCACA 


CTAAATTTGA 

v^. X /W\ X J. X 


TGTCGATGAT 


CATGTTACTA 


GT TAT ATG AC 

VJ X X x\ X X yAv 


ATTTGCGAAT 


2100 


CGAGCAAGCA 


TGCAGTTTGA 


ATGTTCGTGG 


TCTGCAAATA 


TAAAGGAAGA 


TAAGGTACAC 


2160 


GTTAGTTTAT 


CAGGAGAAGA 


TGGCGGTATC 


AATTTATTTC 


CATTTGAAAT 


ATATGAGCCC 


2220 


CGCTTTGGAA 


C T ATTTTTGA 


AAGCAAAGCT 


AATGTTGAGC 


ATAACGAAGA 


C ATTGC TGGT 


2280 


GAGAGACAGG 


CGCGTAACTT 


TGTCAATGCG 


TGTTTAGGTA 


TAGAAGAGAT 


TGTGGTGAAA 


2340 


CCGGAAGAAG 


CACGCAATGT 


AAATGC C CTT 


ATAGAAGCGA 


TTTATCGTAG 


CGATCTTGAT 


2400 


AACAAGAGCA 


TACAACTTTA 


ATGATTATTA 


TATATAATAC 


AAAATTCTCA 


ATATAAAAAG 


2460 


ANGGAGTGCT 


TTTCAATGAA 


AATAGGTGTA 


TTTTCAGTAT 


TATTTTACGA 


TAAAAATTTT 


2520 


GAAGATATGT 


TAGATTATGT 


CGCAGAATCT 


GGATTGGATA 


TGATTGAAGT 


TGGAACAGGT 


2580 


GGTAACCCAG 


GAGATAAATT 


TTGTAAGTTA 


GATGAGTTGT 


TAGAAAATGA 


AGACAAGCGA 


2640 


CAAGCATTTA 


TGAAGTCAAT 


CACAGACAGA 


GGC TT AC AAA 


TAAGTGGTTT 


CAGTTGTCAT 


2700 


iin^nri X v-\_/-i-f-S. 


x x i\, lVLftUA 








vj 1 1 ALU X ftAft 


£ > D U 


ACAATCCGTT 


TAGCAAATCT 


ATTAGACGTG 


CCAGTTGTTA 


ATACATTTTC 


TGNCATTGCA 


2820 


GGGTCAGATG 


ATACCGCTAA 


AAAGCCTAAT 


TGGCCTGTTA 


CACCTTGGCC 


AACAGCCTAC 


2880 


TCTGAAATTT 


ATG ATT ATC A 


GTGGAATGAA 


AAGTTGATAC 


CATATTGGCA 


AGATTTAGCT 


2940 


GAGTTTCCNC 


AAGAGCAAGA 


TGTNNAAATT 


GCCATNGAGT 


TACATNCAGG 


ATTTTTAGCT 


3000 


CANACACCAA 


ATACGATGTT 


NAAGTTACGT 


GAGCCAACAA 


ATGAATATAT 


CGGTGC TAAC 


3060 


TTAGATCCTA 


GTCATTTATG 


GTGNCAAGGT 


ATTGACCCAA 


NTCCTGCGAN 


TCGCATATTA 


3120 


GGCCCNANCA 


AATNCAATTC 


ATCACTTCCA 


TGGAATTCCG 


AAGAAACGTA 


TGTTANTCNA 


3180 


GGGNATGTAA 


ANATGTATGG 


TCTAGC TGAT 


NTCCAGCCAT 


ATGGTNACGT 


TGCGACANGN 


3240 


GCATGGACAT 


TCCNTACAGN 


TGGNTATGGA 


CATAGTCCAT 


ATGNATGGGC 


AGATNTCATA 


3300 


AGTCAACTTA 


NTATTAGATG 


GAG 








3323 



(2) INFORMATION FOR SEQ ID NO: 2 47: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 641 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xj.) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 



5 


TCCATTTAAT 


TCCACCTTGC 


CCCGACAGGT 


TTTNCCGACC 


TGGAAAGCGG 


TCAGTGAGCC 


60 




CNAACNCAAT 


TAATGTGAGT 


TAGCTCCACT 


TCATTAGGCC 


ACCCCAGGCC 


TTTACACTTT 


120 




ATGCTTCCGG 


CTCCGTATGT 


TGTGTGG AAT 


TTGTGAGCGG 


ATAACAATTT 


CACACAGGAA 


180 




ACAGCTATGN 


C C ATGATTAC 


NCCAAGCTCG 


AAATTAACCC 


TCACTAAAGG 


GAACACAAGC 


240 


10 


TGGAGCTCCC 


ACCGCGGTGG 


CGGCCGCTCT 


AGAACTAGTG 


GATCCCCCGG 


GCTGCAGGAA 


300 




TTCGACGAGA 


ACGCGTGCAG 


CGTTTACAGT 


TGCGTCTATT 


GATTTAGGTG 


CGCATCCAGA 


360 




ATTTTTAGGG 


AAAAATGATA 


TTCAATTAGN 


CAAAAAAGAA 


TCTGTAGAGG 


ATACTNCNAA 


420 




AG T ATTAGGT 


AGAATGTTCG 


ATGGTATTGA 


ATTCAAACTT 


TAACTGAGCT 


TCATGGGGCA 


480 


15 


CCAGGTTTTG 


AAGAAGAAGT 


AAAAAATTAT 


ATGACTCAGC 


AAA PGGCGCC 


GTATGTAGAT 


540 




GGAATTTATT 


GAAAATCGTA 


TGGGTGGATT 


TTTTGGGTGT 


GAAAAAATCT 


AAAAATCCAA 


600 




ATGC AAAACG 


TGTAATGATT 


GCAGNACATA 


TGGATGAAAT 


C 




641 



(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1130 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:248: 



35 



40 



45 



50 



55 



C TGC TT AAAT 


TCGAACGTGT 


CATTGACATT 


TTCAATAAGA 


AAAACAACGG 


AGCATGGAAT 


60 


TTCTGTCCGC 


AAAACGCCGG 


TTATTGGGAA 


CATATCCCTA 


AGAGTATTAC 


AAAATTATCA 


120 


GATTTAAAAA 


TCGTTGGTTT 


AGATTTCTAT 


ATCACTACTG 


AAGAATCAAA 


AC G ATT T AC T 


180 


GATTTTCCTA 


AAGACTTTAA 


AGGTATTGCA 


GGTTGGATAT 


TAGAAGTAAA 


ATCGAATACA 


240 


CCAGGTAACA 


CAACACAAGT 


ATTAAGACGT 


AATAACTTCC 


CGTCTGCACA 


TCAATTTNTA 


300 


GTTAGAAACT 


TTGGCACGAG 


ACTGGTGGCG 


TTGGTAAATG 


GGAGTTTATT 


CGAAGGAAAG 


360 


GTGGTTGAAT 


AATGATAGTA 


GATAATTTTT 


CAAAAGACGA 


TAACTTAATC 


GAGTTACAAA 


420 


CAACATCACA 


ATATAATCCA 


ATTATTGACA 


CAAACATCAG 


TTTCTATGAA 


TCAGATAGAG 


480 


GAACTGGTGT 


TTTAAATTTT 


GCAGTAACTA 


AGAATAACAG 


ACCGTTATCT 


ATAAGTTCTG 


540 


AACATGTTAA 


GACATCTATC 


GTGTAAAAAC 


CCGATGATTA 


TAACGTAGAT 


AGAGGCGCTT 


600 


ATATTACAGA 


CGAATTAACG 


ATAGTAGACG 


CAATTAATGG 


GCGTTTGCAG 


TATGTGATAC 


660 


CGAATGAATT 


TTTAAAACAT 


TCAGGCAAGG 


TGCATGCTCA 


GGCATTCTTT 


ACACAAAACG 


720 


GGAGTAATAA 


TGTTGTTGTT 


GAACGTCAAT 


TTAGCTTCAA 


TATTGAAAAT 


GATTTAGTTA 


780 


GTGGGTNTGA 


TGGTATAACA 


AAGCTTGTTT 


ATATCAAATC 


TATTCAAGAT 


AC TATCGAAG 


840 


CTGTCGGTAA 


AGACTTTAAC 


CAATTAAAGC 


AAAATATGGC 


TGATACACAA 


ACGTTAATAG 


900 


CAAAAGTGAA 


TGATAGTGCG 


ACAAAAGGCA 


TTCAACAAAT 


CGAAATCAAG 


CAAAACGAAG 


960 


CTATACAAGC 


TATTACTGCG 


ACGCAAACTA 


GTGCAACACA 


AGCTGTTACA 


GCTGAAGTCG 


1020 


GATAAANTAG 


GTGGGAAAAA 


GAGCAAGCGA 


TTGGGGACCG 


TTTTAACGAA 


GTNGGACAAC 


1080 
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AAATCAATGC GCC TGGCCTT GTNAAGGTAA TNCAACAACA AATTGGGAAA 113 0 

f (2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1104 base pairs 

(B) TYPE : nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 



20 



25 



30 



35 



40 



CCTCAGCATA 


TGGTTGAACC 


CCTTCTTGAT 


AACCCATATC 


TGTTAGTTTT 


CCCCAATGAT 


60 


TGCTCGCATA 


GCTAATTGTG 


CCATTGGAAT 


ATCAAGTAAT 


TTTACTTAGG 


CACGAGAATG 


120 


GTNCCGTTCT 


ACTAGAACGG 


TGGGTTTACT 


TCTAAAACAA 


TATACACCAT 


CGTGAGCTAT 


180 


AACGAATTGA 


ATGTTGATTA 


AGCCAATGAT 


GTTTAAACCT 


TTAGCTAATT 


TTATAGTATA 


240 


GTCCTCAAGT 


GTTGC TAACT 


CGTCTTCTGT 


CAAAGTTTGT 


GGCGGATATA 


CAGCGATTGA 


300 


GTC AC C ACTA 


TGTACACCAG 


CACGTTCAAT 


ATGTTCCATG 


ATTCCTGGAA 


TAATGAC CGT 


360 


TTCTCCATCA 


CAAATCGCAT 


CAACTTCAAT 


TTCTTTACCA 


GTTAAATATC 


TATCGACTAG 


420 


TACCGGATGT 


TCCGGACTCG 


CTTTTACAGC 


CTGGGTCATA 


TAGTTTTC T A 


ACTCTTTGTC 


480 


ATTGTC T AC A 


ATTTCCATTG 


CGCGACCACC 


TAATACATAA 


GAAGGTCTTA 


CTACAACCGG 


540 


ATATCCGATT 


TCTGCAGCAT 


TCGCTAATGC 


TTCCTCAGGT 


GATGTAGCTG 


ATTTCCCTTG 


600 


TGGCTGTGGC 


ACGTTAATTT 


TTCTTAATAG 


TGCTTCAAAT 


TCTTTTCTAT 


CTTCAGCACG 


660 


ATTTAGATTT 


TCTAGTGAAG 


TACCAAGTAT 


TTTAACACCA 


TGTTTAGCCA 


ATTTGTC TGC 


720 


TAAATTAATC 


GCTGTTTGTC 


CTCCAAATTG 


TACAACGACA 


CCTTTAGGTT 


TTTCTAAATT 


780 


AATGATATTC 


ATC AC ATC TT 


CTTCAGTTAA 


AGGTTCAAAG 


TATAATTTGT 


CAGAAATTGA 


840 


AGAAGTCTGT 


TGAAACTGTT 


TCTGGATTGT 


TATTCACAAT 


TATCGCTTCG 


TACCCTGCTT 


900 


TTTGAATTGC 


CCAAACGGCG 


TGAACTGTCG 


CATAGTCAAA 


TTCTACACCT 


TGGCCGATTC 


960 


GAATTGGTCC 


AGAGCCTAAT 


ACTAAGATTT 


CTC TTTGTC A 


GTAACTATGG 


ATTCATTTTC 


1020 


AGTTTCGTAT 


GTACCATAAT 


AATATGGTGT 


TG AG ATC AAA 


TCAGCTGCGC 


AGTATCAACA 


1080 


TCTGTAACAG 


GTTAATATCA 


TTTC 








1104 



(2) INFORMATION FOR SEQ ID NO: 2 50: 

45 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1995 base pairs 
(3) TYPE: nucleic acid 

50 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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10 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 50: 

CTATCATAAA AACATAAAGT ATTGTAAGCT TTTTATCGAT ATTTTTTATT TATAAAAATA 6 0 

AAATGAGATA ACTNTGTGAA TTTTTATTGA GATAAATTAG AT AGTGNTG T TTTTGTGATG 120 

TTTAATAATA TCTTGGGTGT GTTAATACTA GTAATGCTTT CAACTGATGC ATTAAGACTG 180 

TGACATCATA ACTCATTTAA GAACTTCGCT TATTAATTTT CTACCAATAC AATCCCTTCT 24 0 

AAGTGCACTA AAAAATCCTT ACTGC TAAGT GATTAAACTT AACAATAAGG ATTTATTTAT 3 00 

CGC TAATGCA TGATTATTAA CGGAATCTCA TACCACCATC TACAATAATT GTTTGTCCAG 360 

TAATGTAATC AG AG TCTTTA CCAGC TAAGA AGCTCACTAC ATTTGAAACA TCTTCTGGTT 42 0 

GAGAAACTCT GCCCAAAGCA ATCTGACTTG TAAATTGTTC CCAACCCCAT GCTTCAGGTT 480 

15 TACCTGCTTC TTCGGC TGTT GCC AC TGCGA TACTTTCCAT CATTGGTGTT TGAACGATAC 540 

CAGGTGCGAA TGCATTCACA GTAATACCTT CAGACGCTAA ATCTTGTGCG GCTACTTGTG 600 

TTAAACCTCG CACTGCGAAT TTTG T AC TGC AATATAAAGA CAAGCCTGGG TTACCCTCAA 660 

CGCCTGCTTG AGATGTTGCA TTGATAATTT TACCGCCATG ATTGAATTTT TTAAATTGTT 720 

2Q CATGTGCGGC TTGAATACCC CATAGCACAC CTGCAACGTT C ACGC CAT AT ACTGTTTTAA 780 

ACTGTTCTTC AGTAATTGTA TCGATTGGTG TTGTTGGTCC AAGGCCGGCA TTGTTAACCA 840 

TGACATGGAA ATCGCCAAAT TGCGCTCGGA GTTGCCTGTC TTACTGCGTT AAATACATCA 900 

TCACGGTTTG ATACATCTGC TTTGATAGCA ATAGCTTTTG TACCATCACT TGATAATTTA 960 

25 AGTGCAGCTG CTTTTGCCCC TTCTTCATTG AAATCAACAA CTGCTACTTT GAAACCATCT 1020 

TCCACTAAAC GTTCTGCAAT TTTAAAACCA ATCCC TTGTG CTCCGCCAGT TACTAATGCT 1080 

ACTTTGTTGT TTGTCATAAA GATCACTCCT C AAATTTC TT TCCTTTAATT ACATTTTACT 114 0 

CCTCTTCATT TGAATAGTAC AACAAAGGTA GCTCCATTTA ACAAAATATT CAGATATTTA 1200 

AGGTATAGTT AAACGCACTA CCATTAGTGA TTGGCAATGC GTTTAAATGT CGTTTTAAAA 1260 

GTTC TTATGT TGAATATTAT TTTTTTAAGT CTCTCGATTA GTTTGTCATC AATCTTTTTT 1320 

CGAGACATGG GCCTTTTGAT TCAATCGGCG GNTTCCGTGT TATCACTGAC AACTTTAGTT 1380 

GTAGCTTCAT CTTTATGTAT TTCTTCGCTA AATCCTTCAA GGTTTTTAGT CGTGGGATTT 1440 

TTAACCTCAG G ATGTTC CAT CATGTCTTTG ACTATCAAGT TCCTTTTTAC ACGTGTCTTT 1500 

ATGGTGATGC TTGATTTGGC GTTCCCTTTT ACTTTTTTTG AATAGTGGGT AGGTATCTGC 1560 

TGCAGCTACT AATTTNCTTC TTACCCTTAA AATAGATATT GGCTTGGAAA CAAAACCAGA 1620 

GTATTTGCAG ATACAAAGTT GCATTAATAC TTAAAGCAAT AATAGCCAAT ACAAATTAAT 1680 

ATTGACACCT NTTGAAATCC CTTCTTTTAA ATTAAGTCAG ATGCCAATAC GATGACAGGG 1740 

TACGGATTGA AAGTATAATT ACAAATATAG AAATTATTGC CGATATAACT ATTGTTACTA 1800 

TTAAATAATC AGCTCTGCTA CCTGATAATA AATAGAAAAG GCGAAAATTA GTCCATAGCA 1860 

AATTACAAAC CCACATAAAG TTATAGCCAT GAGTACTATA TAAGCTATTT GAAAATATAA 1920 

ACC TATCTTT ATGAATGATT TTCTACATTT TTTCCATGTC TATCCCCATT ATTAATATTA 1980 

TACTTACCTA ATATA 1995 

(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1940 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



30 



40 



45 



50 



55 
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(ii> MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 51: 



70 



15 



20 



25 



30 



40 



45 



SO 



55 



TGT AC TTG AA 


CACTGTACTG 


CGGATGAGTT 


TTTTATTGAC 


ACGGAATTAT 


TAGGGTTCAA 


60 


CGGTGACTTA 


CCACGTTTCG 


TAATTACCTC 


CAAACGACAG 


CTAGTACGTT 


TATAAGATGG 


120 


TGTATCCGTG 


TATTGATCAA 


CATCACTATT 


AGTTAATAAG 


TTAATTGCAC 


C TAG ATC TCC 


180 


ATTTTCCATC 


GC ATC ATT AT 


TTAATGGAAT 


ATAGATTTCT 


TTACCTTTAA 


CACGATCTGT 


240 


CACGTGAACT 


TGTAATACCG 


CTTCTCCTGT 


TTCAGAAATC 


AGCTTAACTT 


CTGCACCTTC 


300 


ATGAATGCCT 


C T ATCTTC AG 


CAAGCTCTGG 


AGAAATTTCA 


ACAAATGCAC 


GTGGC ACTTT 

nJ x wwvov XXX 


360 


GTATTTAATC 


ATTGGTGTTT 


GATAAGTCAT 


ATTACCTTCA 


TGGAAGTGCT 


CTAACAATCG 


420 


ACCATTGTTT 


ACATGAATAT 


CATAAATTTC 


ATCTTGCTTA 


AAGTAATTAT 


CAAATGATAA 


480 


TGGGAATAAT 


TTTGCTTTAC 


CATTATCAAA 


ATTGAATCCT 


TCTAAGTATA 


GAATAGGCTC 


540 


ATCAGTNCCA 


TCCNGNTTGT 


ACTTGNCCAT 


TGT AAAC TAT 


TGAATCCTTC 


TANNCGATTC 


600 


ATAACTTACC 


CCAGCATATA 


GAGGTGTTAA 


GCGTCCTACC 


TTCAATCCAT 


AATTTCACTA 


660 


GGATGCTTGT 


AATTCCAATC 


CAAATCCCTA 


ATCTATTAGC 


AATTGCTTGG 


AAAATTTTCC 


720 


AGTCAGGTTT 


TGAATCACCA 


AGAGGTTCTA 


ATGCTTGGTA 


TAAACGTTGA 


ATACGACGTT 


780 


CGGTATTTGT 


AAAAGTACCG 


TCTTTTTCAA 


GTGAAGGACT 


TGCTGGCAAT 


ACAACATCTG 


840 


CGTATGTTGC 


TGTGAATGTT 


AAAAATTCAT 


CTTGGAC T AC 


CATGAAATCT 


AATTTTTCAA 


900 


ACGCAGCTTG 


TACAAAATTA 


ATATTTGAAT 


CCACAATACC 


CGTATCTTCA 


CCATATAAGT 


960 


ACAATGAGTG 


TACTTCTCCG 


TCATGTATAC 


CTTCTACCAT 


TTCATGATTA 


TCTTTACCAG 


1020 


CTTTTGGATT 


CAATTTAACG 


CCATATTCTT 


TTTCAAATTT 


AGCACGAGCG 


AATATCATCC 


1080 


GCTTCAATAC 


TTTGATAACC 


AGTAATCTTA 


TCAGGCATAC 


TTCCCATATC 


ACTACATCCT 


1140 


TGAACATTAT 


TATGTCCACG 


TAATGGATAC 


GCACCAGTAC 


CAGGACGACG 


ATAATTACCT 


1200 


GTTACTAATA 


ATAAGTTTGA 


AATCGC TGT A 


CTTGAGTCAC 


TACCAATGTC 


TTGTTGTGTA 


1260 


ATACCCATTG 


CCCAACAAAT 


TACAACAGAT 


TCAGCTTTAG 


CACATTCTTC 


AGCAAATTTA 


1320 


ATC AATTC TG 


ATTCAGGAAT 


ACCTGTTGCT 


TCTTCAGCAA 


AAGCCATTGT 


AAATGTTTCT 


1380 


AATGATTTGT 


AATATTCATC 


AAAATCATCT 


ACCCACTCAT 


CAATAAATGC 


TTTATCGTGT 


1440 


AAATCATGAT 


CAATAATATA 


CTTAGTC AC T 


GCACTTAACC 


ACGCTAAATC 


CGTACCTGGT 


1500 


TTAGGTTGAT 


AAAAACGATC 


CGCACGTTCT 


GCCATTTCAT 


GTTTTCTAAT 


ATCAAATACA 


1560 


TGTATTTTTT 


GTCCAAATAA 


TTTTTGAGCA 


CGTTTCATGC 


GTGATGCGAT 


AACTGGATGA 


1620 


GCTTCGGCTG 


TATTGGTACC 


TATCAATACA 


GACATTGCCG 


CTTTTTCTAA 


ATCTTCAATA 


1680 


C TACCTGAGT 


CACCGCCGTG 


TCCAACCGTT 


CTAAATAAGC 


CTTTTGTTGC 


AGGTGCTTGG 


1740 


CAATATCTTG 


AACAGNTATC 


AACGTTATTT 


GTGNCAATAA 


CTTGGTCTTG 


CTTAATTTNN 


1800 


GGGATGNAAA 


TACGATTCTT 


CCATTCCGGC 


CGGTTTTAGA 


AGAAGGAAAT 


GAATTGATAG 


1860 


TGCATCTTGG 


GCCAATACTT 


NNNCTTTNAA 


TAGATGTAAA 


ATTATCTGCA 


AATGACAATT 


1920 


TAAAGGTTCC 


ATCCCATCTT 










1940 



(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 847 base pairs 

(B) TYPE: nucleic acid 



209 



EP 0 841 394 A2 

.(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:252: 



15 



20 



25 



GATTCAACAT 


AACGTCGTGT 


CCTTCAGCAT 


ATATAGTATC 


GAATGCTATG 


GACGATTAAC 


60 


CTGACCCAGA 


TAAACCTGTC 


ATAACAATTA 


ATTTATTTTT 


AGGTAGTTCG 


GATATCAATA 


120 


TCTTTCAAGT 


TATGCGCACG 


AGCACCTTTT 


ACT AC TATGG 


ATGGNTCTTT 


CATTTACTTG 


180 


TCACCCNTCT 


GCTTTTAATT 


CAAATAACAT 


ATCTCTTAAT 


TCNGTAGCTT 


TCTCGAAATC 


240 


TAAATCTTTC 


GCTGCTTGTT 


TCATTTCTTT 


TTCTATATTG 


TCGATTGTCT 


TTNGACGTGC 


300 


TCTTTTCGGC 


ATCTTCTTAG 


GTATCACAGT 


TTGTGCTTNG 


TCATNATTTT 


CGTCATTTTC 


360 


AACAGTAGCA 


CTAATTAAAT 


CATGTACTTN 


TTTATTAATT 


GTTTTAGGTG 


TAATACCATG 


420 


TTTTTCATTA 


TGTTTCATCT 


GTATTTCTCG 


ACGACGTTGT 


GTCTCATCAA 


TTGCATACTT 


480 


CATNGAATCA 


GTCATATTAT 


CGGCATACAT 


AATGANTTCA 


CATTTATCGN 


TACGCTCGTG 


540 


CCGAATCGGC 


ACGAGCTCGT 


ACCTATTGTT 


TGAATTAATG 


AGCGGNTAGA 


ACGTAAAAAT 


600 


CCTTC TTTAT 


CTGCATCTAA 


TATGACAAGT 


AGAGAAACTT 


CTGGTATATC 


AATACCCTCT 


660 


AC TT AATAAA 


TTAATAC CT A 


CGATAACATC 


ATATGTACCC 


ATTCGTAAGT 


CTCTAATTAT 


720 


TTCGATTCGT 


TCGTAATGTC 


TTGATTTCTG 


AATGC AG AT A 


ATTAACTTTA 


ATACCCGCTT 


780 


CTTTCATGTA 


TGTGGTTAAA 


TCTTCACTCA 


TCTTTTTAGT 


GAGCGTTGTA 


ACAAGTACGC 


840 


TCGTGCC 












847 



(2) INFORMATION FOR SEQ ID NO: 2 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:253: 



45 


CCGCTTCGCA 


CGAGCNAAGG 


CTTTTATTGC 


ANCCCTCGCA 


TCTTCTTTAG 


CTTACACTAA 


60 




ATCTTTGATG 


TCTTGTAATG 


CTTGTGCAAG 


TTGTGCTTTG 


GCTTGTTCAA 


TTTCTTCTTT 


120 




AGTCATCGCA 


TTGTTAATGT 


CGTTATGACC 


TTGTTGAAGT 


ATTTGANTAA 


TTCGATCTTT 


180 




AAGCGCTTGT 


TTNTCCTTAT 


CTGTTAGATT 


TGGATTGTGA 


TCGATTTCGN 


CAATT AATGC 


240 


50 


TTGAAC TTGT 


TTATCAACAT 


CTTGTTTCGC 


ATCTTC TTT A 


GCTTACACTA 


AATCTTTAAT 


300 




GTCTTGTAAT 


GCTTGCGCAA 


GTTGTGC TTT 


AGCTTGTTCA 


ATTTCTTCTT 


TAGTCAGCGC 


360 




ATTGTTAATG 


TCGTTATGAC 


CTTGTTGAAG 


TATTTGATTA 


ATACGATCTT 


TTAAGTGCTT 


420 




GTTTTTCCTT 


ATCTGTTAGA 


TTTGGATTTC 


GATCAATTTC 


GTCAATTAAT 


GCTTGAACTT 


480 


55 


GCTTATCAAC 


GTCTTTATTG 


GCATCAATTT 


TNGCTTTNGG 


TATTTCATTA 


GCANGCACTN 


540 
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GTTCAATCGC 


GTGGTTGCCT 


GC TGTTTG AA 




AbL k_ 1 w A X X A 




600 




TATT AATGTT 


GTTGATGATG 


/•■»rTV*/" , »T' , ^T"TV"^/" , 'f" , 

CTGOT 1 IajV-u 


AA i ILi iLl 1 


V?\_ 1 1 AAA 111 


TTCCSGCAATA 


660 


5 


AGCTTGTCTT 


GATCCGTCGC 


ATTTGAAGCT 


TCGATTTCTT 




AvjV.. IAHHuL 1 


T>(\ 
1 z u 




TGATTAATTG 


ATTGAATTGC 


CTTGTCTTTA 


GCATCTTGTA 


GTCGTTGATC 


ACCATTAAGA 


780 




TTATGGATTG 


CATCATTGAC 


TGCTTGGATT 


GCGCCATTGA 


TATCATTCAC 


ATTTGTGTTA 


840 




TCACTATTTA 


GCAATGTATT 


TGCTAGACGT 


TTGGCATCAT 


CGAAGTTTGT 


TTTAGCATTA 


900 


10 


TCGTCAGCGT 


TTTGGTAATT 


GACAGTTTGC 


TCTGCATTTG 


GAATTTCATT 


GTCAACTAAA 


960 




TGTTTCAATG 


TTTCCATTGC 


ATCATTTAAG 


TCAATTTGAT 


TATTAACAAT 


ATCTGTTACA 


1020 




TCTGATACAG 


TATCGGCATT 


GTTAATTGCT 


TTATGTGCAA 


GATCTTGTTG 


CTGTTGATTT 


1080 




AATCCATTTA 


ACGAATTAAC 


AAACGCATTT 


GCTTTATCCT 


TTGGCATTTG 


CAAGGTTTTG 


1140 


15 


GTCTCCATTT 


AATGCATTTT 


GAGCATCGAT 


AATATTTTGT 


TTCAATTGCT 


CTGCTTCAGC 


1200 




TTTTGCAATT 


GCATTACTTG 


CACTTTATCT 


AACTCATGTG 


CTGCATTCGC 


AATGCATATC 


1260 




AT AATTTGC T 


TTCAATGTCA 


TCTGCATGAT 


GTATTGTGCT 






1300 



(2) INFORMATION FOR SEQ ID NO: 254: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1425 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNSSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 54: 



35 



40 



45 



50 



55 



CTTTTCATCT 


CTTCGACCTG 


TGTAATACCT 


TTGATCAGCT 


GCACAAATAG 


ACAAGTATAA 


60 


AGGTCTTACG 


ATGACATGAT 


TACCATAAAT 


ATCAACATTA 


TTATATGTGA 


CGTCGAACTG 


120 


TCTCGGTGCA 


ACGAGTTGAT 


ATACTTGATT 


AATCATCGGC 


AATATCACCT 


TGAATAATGG 


180 


NATTTGCTAC 


TTTNAANTCA 


TNCGGGGTTG 


TCACTTTAAT 


GTTGTATAGT 


TCTCCACGTA 


240 


CCAATTTAAC 


TGCATGTCCA 


GATTCGACAA 


TTATTTTACA 


TGCATCTGAT 


AAGATTTCTT 


300 


TNTGGTTCAC 


TACTTAAGGC 


GCGATAACTA 


TCTTGTAATA 


ATTTAATATT 


AAATGATTGT 


360 


GGTGTTTGGC 


CTTGATACAT 


TTCATTCCTT 


ACAGGGATAC 


TGTGTATGTT 


CTGTTTATCT 


420 


TTAGACATTA 


CAATCGTATC 


AATTGCTTCA 


ATGACTGTAT 


CTACTGCACC 


ATATTNNGCT 


480 


GCTACTTCAA 


TGTTCTCTTT 


AATAATACGT 


TGAGTTAAAA 


ATGGTCTTAC 


GGCATCATGA 


540 


GTTACAATCA 


CATCATCATT 


ATTAATTCCA 


TTTACATTGC 


GAATATGGTC 


GATAATGTTC 


600 


ATAATTGTTT 


CGTTTCGATC 


CGNACCACCT 


GCAACTACTT 


TGACACGTTG 


ATCTGTAATG 


660 


TTATATTTTT 


TTAAAATATC 


CTGTGTATGG 


GAAATCCACT 


GTGCTGGCGT 


TGCGATAATA 


720 


ATCTCATTAA 


ATTCACTCAC 


TAAAATGAAC 


TTCTCAATTG 


TATGGATTAA 


AATCGGTTTA 


780 


TTATCAATAT 


CTAAAAATTG 


TTTTGGGTAA 


TGGCACGTTC 


CCCATTCTCG 


AACCAATACC 


840 


TCCTGCTAAA 


ATACCTGCAT 


AAATCATGTT 


GGCCTCCATC 


CTGTCATTAC 


ATC ATTTCC A 


900 


TTTATACATT 


ACTGACCTAT 


GCCCGCACAT 


AAGCCTAACC 


TATTGCTCAC 


TNGNC TCTTN 


960 


TATTAATCCA 


AAGATAGTTG 


TCACAATAGT 


GTGATAATTN 


TTTATAAAAA 


TGTATTTNTG 


1020 


NNACTGACCA 


NTCTAAGTTG 


TTTTGGCATG 


CAGCTAATCA 


TTAAC TCTG A 


CGATATTAAA 


1080 
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TTGTTAAAGG TATTAATGTT TACTCTTTC A CAAATTCATT ATTACTGCCA TCATTTNNCC 114 0 

ATATATTATA ATAAATTTAT CTTATTAAGT GGCTGNACTT GATTTTC AC T TTAAAAATTA 12 00 

5 TCAAATATTG CCATCTCATT TTAAGNATAC AAAATGCAAA ACAACCGATT CACAAGCATA 12 60 

TTTCACACAA GTAAACCGGC TATTTATCAA CGTATATTCG AAGATGAATT ATTTCGATAG 13 20 

TATCTTGTAG AC C AGNCGGC ATTCGCACTT TCAATAGCNT ATT AAC TT AT ACCAGNGGTT 13 80 

TCGTCCTCNA NGGTGCATAC TAATAAATCG TAAACNTGAC TTTAG 1425 

10 

(2) INFORMATION FOR SEQ ID NO:255: 



(i) SEQUENCE CHARACTERISTICS: 

T5 (A) LENGTH: 3160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 



30 



35 



40 



45 



50 



55 



TTTCTTACCT 


TTAATAATTG 


GTAATTCATC 


TTCCATCCTC 


TTGTTTGATC 


TGTGTCGCAT 


60 


AAGCTTCATC 


AAATAGTACA 


TCAGTACGTG 


GAACACCTGT 


AGGCACTACA 


TTTTTCTCTT 


120 


TAATACCAAA 


TGCTTCAGCG 


TAGAATGGAA 


TATCGGTTTC 


AGATGATACA 


TAAGCTTTGG 


180 


CACGAGCTAC 


GATGATTTAA 


TGAATCAATA 


AATGGTCCAC 


CCTTTTTACC 


AGTACGACTA 


240 


AAGCCAACTG 


TTTTAAAGGC 


ACCAACGGCA 


TGCCATACTT 


GAATAACTTC 


TTGAGAACGT 


300 


C T AAAACGC A 


CTGTATAAAT 


CAAGGGGTGA 


AAGTCATCAA 


CAAAGATGTA 


GTCTGCCTTC 


360 


CCAAGTAAAT 


ATGGCAATCT 


AAACTTGTCG 


ATGATGCCAC 


GTCTATCTGT 


AATATTCGCT 


420 


TTAAAAACAG 


TGTGAATATC 


ATACTTTTTA 


TCTAAATTTT 


GACGTAACAT 


TTCGTTATAG 


480 


ATGTATTCAA 


AGTTTCCAGA 


CATCGTTGGT 


CTAGAGTCTG 


ATGTGAACAA 


CACCGTATTC 


540 


CCTTTTTTCA 


AGGGTGAAAA 


ATTTCGTCGT 


ATTAAATATC 


GCNTNAAAAA 


TAAATTGTCT 


600 


TTGTTATTAA 


ATGAATGTTT 


GCGG AAATAC 


NTACGTAATT 


CTTTAATATT 


TACGAACGAA 


660 


AATAAATACT 


TTTAACTTCC 


GGAGTCGTTA 


CNACCAACAT 


CAAGGACAAA 


TTCATTAACA 


720 


TTCGCNAGAA 


ATTTCAGGGT 


GTAACAGTAT 


AAACCGTTTT 


CTTCGAAATG 


CCGCCTTTTC 


780 


TAAATTCTTT 


TAGGTAAGTC 


TGCAATAAGA 


AATTGATTTT 


ACC ATTTTG T 


GTTTCTAATT 


840 


CGTTGTATTC 


TTCTTCTTGT 


TCTGGCTTTA 


GATTTTGATA 


TGCATCATTA 


ATCACATCTG 


900 


GGTTTANCTG 


TGCAATATAA 


TCAAGTTCTT 


GCTCATTCAC 


TAATAAGTAC 


TTATCTTCAG 


960 


GTAAGTAATA 


ACCATTATCT 


AAGATAGCTA 


CATTGAAACG 


ACAAACGAAT 


TGATTCCCAT 


1020 


CTATTTTGAC 


ATCATTCGCC 


TTCATTGTAC 


GTGTCTCAGT 


TAAATTTCTT 


AATACAAAAT 


1080 


TACTATCTTC 


TAAATCTAGG 


TTTTCACTAT 


GTCCTTCAAC 


GAATAACTGA 


ACACGTTCCC 


1140 


AATAGATTTT 


ATCTATATAT 


ATCTTACTTT 


TANCCAACGT 


TAATTCATCC 


TTTTCTATTT 


1200 


ACATAATCCA 


TTTTAATACT 


GTTTTACCCC 


AAGATGTAGA 


CAGGTCTGCT 


TCAAAAGCTT 


1260 


CTGTAAGATC 


ATNAATTGTT 


GCAATTTCAT 


ATTCTTGACC 


TTTTAACAAC 


GCTAATTTAT 


1320 


C T AC AATATC 


TGGGTATTGA 


ATGTATAAGT 


CTACAACATC 


TTGGAAATCT 


TGTGACCCAC 


1380 


TTCGACTACT 


ACCAATCAAC 


GTTAG TCCTT 


TTTCCAATAC 


TAGACGTGTA 


TTAACTTCTA 


1440 


CTGGGAACTC 


ACTTACACCT 


AACAGTGCAA 


TGCTTCCTTC 


TGGTGAAATA 


TANTCAATCA 


1500 
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35 



TTTGATTTAT 


TGCTGATTGG 


NTACCACGCC 


CACCAACGCA 


CTCAAATGCA 


TGATCANATG 


1560 


TTAAGCCTTC 


AGGTATTTTA 


TTAATAAAGA 


AGACATCATC 


ft ft **> * k « m/^ ft r** 

AAC AAATG AG 


ft ft i^*m^* m am/h -» 

AAGTGACTCA 


1620 


ATTTATAATC 


TTGGTNNACC 


AAAGACATAT 


ft rTT/^^ftT ft /"* ft /"Nrtl 

ATGGNAGACT 


TCTGGATATT 


ft * m KY /*> ft /*"*/"^/"^ ft 

AAT NG ACGGA 


1680 


ATAAAATGGC 


TGTAATGTAA 


CCTAAGTTAC 


CATCACCCCA 


AATACCAAAT 


GTATTTTTAT 


17 4 0 


TTGAAATAGA 


TTTCTTTTCA 


AC AACG ACG A 


AT AGC ATGC A 


AACTTACTGT 


TACAAGCTCT 


i o n i\ 
18 00 


GTAGATGAAA 


T AAT AC TT AA 


ATCAATATCA 


TC AGG T AGTG 


GTACAGCTCT 


ft mr* ft nrw*> ft mfT m 

ATC ATG ATT T 


1860 


AGCAACACAA 


AATCTTGCAT 


AAACCCATCA 


TG i CCACTTG 


ft ffV^fTV^ ft ft <""«fT*ft 

ATC TG AAGT A 


GC TCG ATTTT 


T Q T A 

iy z u 


AAATAGTTTT 


CAGCAATGAC 


ATCGTCTTTT 


TCTGTCGGCG 


m ft n^rr*/""*/"**/"^ rn> ft 

TATTCGGTAC 


^ ft. rri ft ft ft rn 

C AT AACTAC T 


1 o a n 


TTTGTACCTT 


TATTAAACAC 


AC CTTT AC TG 


rrv» ft » » *n ft /*•/"■"■ * 

rC AAAT AC GA 


✓> rr>/^ ft y*** ft ft ^ 

CCTCACCAAC 


ft **f Ttnri^ ft rrV~* ft 

ACCTTCATGA 




ATTAAAGACA 


TTGGCAATTT 


C TGAGAT AAG 


ft ft mmy^m/"* ft m 

ACATTCTCAT 


CACGGCTACC 


ft /™*nn t\ m ft ft m ft m 

AGTATAATAT 




CTTTGATCGG 


C AGC AC AAAT 


TG AC AT AT A I 


ft ft ft /"■■/** ft ✓"V , f< ft 

AAAGGACG I A 


ft ft mT 1 ft /** ft T ft 

CAATG AC A I A 


/~>n~i/~i ft /^>iTV~ , T l ft ft 

G 1 GAG 1G 1 AA 




ATATCTACGT 


TGTTATACGT 


AAC TTC AAAT 


I Gl CTAGGTG 


i - * * ft /">n\ ft jv mir><^ 

CAAL TAAI lb 


ft T> ft rr> ft /"•mm/"' ft 

A I A TACTTGA 


o o o n 


TTAATCATCG 


GCCAATACCA 


/"*/"*mj*"*^ ft ft m ft ft 

CC TCGAATAA 


1 AGGATTCGC 


TACTTTTAAA 


ILLj i aagg 1 Vj 




TTGTTACTTT 


ft % m/^mm * m it rn 

AATGTTATAT 


AACTCACCTT 


M ti I • ft ft ft /~*ffi /"»/"» 

1 TALAAGl CG 


AACCGGTTTG 


TTTGTTTCTA 


^ J ft \J 


CAATAATCTT 


ACAAGCATCA 


ft nn ft ft ft ft m) x 

GATAAAATAC 


TCTTTTGC TC 


ft fPP ft OT»/^ ft ft f"* 

ATCACTCAAL 


TV^TV^/^ ft ft f~±f~* 

1G 1GGA TAGG 


o /i n r\ 


TTTCTTTTAA 


rr» ft ft ft nrtrn ft ft rti ft 

TAAATTAATA 


TTAAACGAT 1 


/■^ ft /"V— rrv~" TOT**"!* 1 /"*" 

GAGG 1 vj I 1 rG 


ft. /~* TiTV^f f A 1""" 

AGO. 1 1 GG 1 


ATTTCATTAC 


7 4 fi fi 


GLAL rGGAAT 


fn/^p ft rp/^> » ft nv^ 

TGLATCAATC 


/~*iTinr\rn/~> ft mm ft rp 

GTTTGA1 1A1 


Gill AGA 1\j I 


ft ft f ft ft <TVT"" r Pft 

AALAAlVbl A 






CAATC AC TGT 


ATCTACTGCA 


CCGTACTCTA 


AAGCAGCTTG 


AATATTTTCT 


TTAATAATAC 


2580 


GATGCGTTAA 


AAATGGTCTA 


ACTGCATCAT 


GTGTCACAAT 


GACATCGTCA 


TCGTTAATAC 


2640 


CATTTGTTGA 


TTCAATATGT 


TTAACGATAT 


TCATAATTGT 


ATCGTTACGA 


TCGCTACCAC 


2700 


CTTGAATGAC 


TTCAATTCTT 


TCATCAGAAA 


TTTTGAATTT 


TC TAAGTGT A 


TCTTTCGTAT 


2760 


GCGTCATCCA 


TTGTTGTGGC 


GTCGCGATAA 


TAATTTTTTC 


AAAATCATTA 


ATTAAAATAA 


2820 


ATTTTTC TTA 


ATGTATGGAT 


TAAAATCGGN 


TTGTTGTCTA 


AATCTAAAAA 


TNGTTTAGGT 


2880 


AAAGGGTACG 


GTTACCCATT 


CCTTGAGCCT 


ATACCTCCAG 


CTAGAATACC 


AGCGTATTTC 


2940 


ATAAAATACT 


TCCTCCATTC 


AACTATATCT 


ATATTTAATT 


ATTTAAATTT 


CGTTGCATTT 


3000 


TCCAATTGAA 


AACTCATTTT 


AAAATCAAAA 


CTCTAAATGT 


CTGTGTATTA 


CTTAAAATTA 


3060 


TACATATTTT 


GCTTATATTT 


TAGCATATTT 


TGTTTAAACC 


TATATTACAT 


TATATCAGAC 


3120 


GTTTTCATAC 


ACAAATAATA 


ACATACAAGC 


AAACATTTCG 






3160 



(2) INFORMATION FOR SEQ ID NO: 256: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

45 

(D) TOPOLOGY: linear 

<xi) MOLECULE TYPE: Genomic DNA 

SO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:256: 



AAAAAAACAA TTTANCCCAT NACNACTTGT CGTTAATTAT TCATACGAAA TACATGATTA 60 
ATGTACCACT TTAACATAAC AAAAAATCGT TATCCATTCA TAACGTATGT GTTTACACAT 120 
TTATGAATTA GATAACGATT GGATCGATTA TTTTATTTTA CAAAATGACA ATTC AG TTGG 180 
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5 



10 



15 



20 



25 



30 



AAGGTGATTG 


CTTTTGATTG 


AA 1 CGLL 1 iA 


IVjrAAAA 


A 1 LAnArvub 1 


1 A 1 i L 1 A 1. 1 


240 


Ol nlnvjlLL I 


GO. 1TCTCATC 


TV rp^ jv O TV rp/TPf 


1 Lnt 1 1 LA 


1 Ivjl V, AU/VAV, 


LL 1 J- L 1 lvj^V\ 




* TV i* vn Jv rTVT" , /"""'P 


I A TGACTCAT 


1 lOLAiljOLA 


A PIT A A TAT IT 
r\\j 1 AA LAI Al 


LtLLAAV.nl 1A 


nr^n r Tf~"T'Tk a hp 

LrL blL 1 AAAL 


? n 


* TV IV TV »rpi I'fwT/^" TV 

AAATCT rNGA 


LTAAACGTTC 


AC 1 WvjAoUjA 


L LA 1 L ioun 1 


TVfp«pm IV * TV 7VVTO 

A 1 1 1 AAAAJNf \j 


111 ALtL 1 AAVJ 


•a Z U 


» IV rrvym TV A 7V 


CTTTTTCAAC 


L 1 LAIAA 1 L 1 


1 bAa 1 va 1 LLA 


AAvjLA 1LLA1 


1 AA luLA 1 LA 


fi O VJ 


AAGGATTGTA 


o tv 7v mumwi tv /"•»/■* 

CAATTTT AC C 


rri/-»^~» A TV TV A A *T* 

TGGAACAAA I 


VjA i L. LA 1 A 1 Vj 


PTTP1T1P1 A 
ul 1 LA 1 AvjAA 


A IVnLvjLvj l L 


J-iu 


/">f"n 71 TV ftV'HI 7V TV rTI 

GTAATGTAAT 


C TTCT AAG TC 


AAA TGC A J AG 


A A A A TP A TV/"' 
AAAA 1 LA 1 LL» 


pjnmmBvpif AAA 

L*L 1111 1 AAA 


TAPTPSPlTaT 
1 AL lyLA 1 A 1 


ouu 


TCATATATTA 


X TV /*■* X •"P/"*" TV JV TV 

AAGATGAATA 


ATC AC I AA 1 C 


AAC AAG 1 C l\3 


Tiipiaipup 
1 AALAAAbAvi 


a iTarrpPTTl 

AA 1 A 1 V_Vj 1 1 A 


^ n 


TV /"»*TVTV"» TV /~*f TV fP 

AC 1 TCACGAI 


p tv rprTTV*^ tv c* tv p 
liAI C IXjAC AC 


Ji UTT 1 JV m TV TV TV f* 

A 1 \^.vjA l AAAb 


1"* A TTT" TV*" m A T* 


PTT»P A rVTITYZ A 
Lt 1 ILALuIAjA 


AA 1 A 1 1 AAvJ 1 


79 Pi 


CTATTTTTTA 


r*T* jv tv /T*r+ tv mo 
CG AAC GGATG 


X7 jv rrvrrvprp/"" jv A T 
PJA1 i J. 1GAA1 


jv IPiril OTP 
AALALAAL 1 vj 


WA 1 1 A 1VL 1 1 


L 1 \_Ltv_ ALtOA 1 


'OU 


r^rY*-"T'<"""/'"'f"n TV TV TVO 

CTTGC rAAAL 


G 1 TLAAAA I C 


AATTTTAAAA 


A BWyiTl JT 
AA IVuA 1 AA 1 


Vj luLLu 1 ALL 


ryiY^appiTTJi 

UluALLnl 1A 


ft a. r> 


/-^•/-VT-/~Tp TV TV 7\ TV"" 

CCT C I AAA1\j 


ILviG 1\jLvxAA 


1 AvjAA i AAL I 


TTPTT A rrTT 
llv.ll AL v.11 


1 Inlnni iVjLj 


rAATTPATrT 

V_r-VrA J. X >— r\ i. V, 1 


900 


TCC ATC TC TT 


ul 1 1AA 1 lib 


L\3 1 luCAlAA 


/ywr A TV A A 
L-L 1 1 V.A ILArt 


A 1 Av3 1 ALA 1 L 


jfiVJ X rtV_vj X VJVJ X 


960 




vj 1 ALAAL A l I 


llvl IV. i V i A 


i- J. > — V».fVrVr\ 4. VJ 




AAATOGAATA 


1020 


TCGGGTTTCT 


*■ rtv^ tv ti * tv m 

GATGAAACAT 


ATGCTTTCGG 


rTVTnTV TV 0/T> TV 

TGTAACTACG 


ATGGGTTTTA 


ft TV*' ft ft ' 1 V '/**' 7A «T» 

ATG AATCGA 1 


i d p» 


AAACGGACCA 


CCTTTTTNAC 


CTGTACGACT 


AAAGCCAACT 


GTTTTAAAGG 


CACCAACAGC 


1140 


ATGNCACACT 


TGAATAATTT 


CTTGTGATGG 


TCTAAAGCGA 


ACCGTATAAA 


TTAATGGATG 


1200 


GAAATCATCA 


ACAAAAATAT 


AATCGGCCTT 


AC C AAGTAAA 


TATGGCAATC 


TAAAC TTGTC 


1260 


TCTCCATTTG 


CGTCTATCCG 


TAATATTCGC 


CTTAAATACC 


GTTTTAATAT 


CATAATCAAA 


1320 


ATCTACTTTG 


TGGCGTAGTA 


ACTCATCATA 


TACATACTTG 


AAATTCCCTG 


ATAAATTCGG 


1380 


ACGCGAATCT 


GATGTGAATA 


ATATTGTTNT 


GCTTCTTTTA 


ATATGTAGTA 


ACTTTGTAAT 


1440 


ATTAAAAATA 


GCTTTAAATA 


AGAAACTTCT 


AC TTTC AAAT 


GAAGCTTTAT 


GACCTTGTTT 


1500 


ATGAAGCCAG 


TGTGC AC TTG 


GCGCAATGAC 


CCTGATTTCT 


CTTGAGGTAA 


GGTGATTTCA 


1560 


ATATCAAATA 


CAAATTCGGN 


T AAC CGTCAC 


TTGGCTATCT 


CCGGAGTAAT 


GGTATTAGAC 


1620 


CGTATGTTGT 


GATACGCCAC 


CTTTACGGAA 


AACTTTAGCA 


TCATACGCTA 


ATAAAG 


1676 



(2) INFORMATION FOR SEQ ID NO: 257: 

35 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3110 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:257: 



50 



55 



GATTCGAGCT 


CGGTACCGGG 


GATCCTCTAG 


AGTCGACCTG 


CAAGCTATAC 


CAAGCTAGAG 


60 


TACTGCGACG 


CAAACTAGTG 


CAACACCTAA 


GGCTGTTACA 


GCTTGGAAGT 


CGATAAAATA 


120 


GTTGAAAAAG 


AGCAAGCGAT 


TTTTGAACGT 


GTTAACGAAG 


TTGAACAACA 


AATCAATGGC 


180 


GCTG AC CTTG 


TTAAAGGTAA 


TTCAACAACA 


AATTGGCAAA 


AGTCTAAACT 


TACAGATGAT 


240 


TACGGTAAAG 


CAATCGAATC 


GTCTGAGCAG 


TC C ATAGATA 


GCGTTTTAAG 


C AC AGTTAAC 


300 


AC ATC TAGG A 


TTATTCATAT 


TACTAATGCA 


ACAGATGCGC 


CAGAAAAGAC 


GGATATAGGC 


360 
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ACGTTAGAGA AGCCTGGACA AGATGGTGTT GATGACGGTT CTTCGTTCGA TGAATCAACT 420 

TATACATCAA GCAAATCTGG TGTGTTAGTT GTTTATGTTG TTGATAATAA TACTGCTCGG 4 80 

TGCAACATGG TACCCAGACG ATTCAAACGA TGAGTTCACA AAATACAAAN TCTTACGGCA 540 

CATGGTACCC GTTTTATTAA AAAGAATGAT GGAAACTTAA CTAGCAATTT GGTTGGAGGA 600 

ACGTCTTACA ACGCTTTAAA TCAAGCTAAG CAGTATGTAG ATGATAAATT CGGAACAACG 660 

AGCTGGCAAC AACATAAGAT GACAGAGCCG AACGGTCAAT CAATTCAAGT TAACTTAAAT 720 

AATGCGCAAG GCGATTTGGG ATATTTAACT GCTGGTAATT ACTATGCAAC AAGAGTGCCG 780 

GATTTACCAG GTAGCGTTGA AAGTTATGAG GGTTATTTAT CGGTATTCGT TAAAGATGAT 840 

ACAAACAAGC TATTTAACTT CACACCTTAT AACTCTAAAA AGATTTACAC ACGATCAATC 900 

ACAAACGGAA GACTTGAGCA ACAGTGGACA GTTCCTAATG AACATAAGTC AACGGTATTG 960 

TTCGACGGTG GAGCAAATGG TG TAGGT AC A ACAATCAATC TAACTGAACC GTACACAAAC 1020 

TATTCTATTT TGTTGGTAAG TGGAACTTAT CCAGGTGGCG TTATTGAGGG ATTCGGACTA 1080 

ACCGCATTAC CTAATGCAAT TCAATTAACC AAACCGAATG TAGTTGACTC AGACGGTANC 1140 

GGTGGCGGTA TTTATGAGTG TTTACTATCA AAAACAAGTA GTACCCACTT TAAGAATAGA 1200 

CAACGATGTG TATTTCGATT TAGGCAAAAC ATCAGGTTCT GGAACGAATG CCAACAAAGT 1260 

TACTATAACT AAAATTATGG GGTGGAAATA ATGAAAATAA CAGTAAACGA TAAAAACGAA 132 0 

GTTATCGGAT ACGTTAATAC TGGCGGTTTA CGCAATAGTT TAGATGTAGA TGATAACAAT 1380 

GTGCCTATCA AATTCAAAGA AG AG TTTG AA CCTAGAAAGT TTGTTTTCAC TAACGGCGAA 1440 

ATTAAATATA ACAGCAATTT TGAAAAAGAA GACGTACCGA ATGCATCAAG CCAACAAAGT 1500 

GAATCAGATT TGAGTGATGA AGAACTTCGC GG AATGGTTG CGAGTATGCA AATGCAGGTG 1560 

GCACAAGTAA ACGTATTAAC AATGGAATTA GCTCAACAAA ACGCTATGTT AACACAACAG 1620 

TTGACTGAAC TGAAAACTAA CAAAACAAGT ACTGAGGGGG ACGTTTAATG ATGAAGATGA 1680 

TTTATCCGAC TTTTAAAGAC ATCAAAACTT TTTATGTTTG GGGTTAC TAT AAAAACGAGC 1740 

AAATTAAGTG GTACGTAGAC AAGGGTTTAA TCGATAAAGA AGAATACGCT TTAATCACTG 1800 

GAGAAAAATA TCCAGAAACA AAAGATGAAA AGTCACAGGT GTAATGCTTG TGGCTTTTTA 1860 

ATTTAACAAA AAGTAGGTGG CGTAATGTTT GGTTTTACCA AACGACATGA ACAAGATTGG 1920 

CGTTTAACGC GATTAGAAGA AAATGATAAG ACTATGTTTG AAAAATTCGA CAGAATAGAA 1980 

GATAGTC TT A GAGCGCAAGA AAAGATTTAT GACAAATTAG ATAGAAATTT TGGAAGAATT 2040 

AAAGCGCGAC AAGGTAGAAG ATGAAAAGAA TAAAGAAAAG AATGCCAAGA ATATTAGAGA 2100 

CATAAAAATG TGGATTCTCG GTTTGATAGG GAC TATCTTC AGTACGATTG TCATAGCTTT 2160 

ACTAAGAACT ATTTTTGGTA TTTAAAGGAG GTGATTACCA TGCTTAAAGG GGATTTTAGG 2220 

ATATAGCTNC TGGGCGGTCC TCCTGGTTGG GGG TAAATGT AAATAACAGT TAAGAGTCAG 2280 

TGC TCCGGC A CTGGCTTTTT ATTTTGATTG AAATGAGGTG CATACATGGG ATTACCTAAT 2340 

CCAAAGACTA GAAAGCCTAC AGCTAGTGAA GTGGTGGAGT GGGCAAAGTC GAATATTGGT 2400 

AAGAGGATTA ATATAGATAA TT ATCGGGGC AGTCAATGTT GGGATACACC TAACTTTATT 24 60 

TTTAAAAGAT ATTGGGGTTT TGTAACATGG GGC AATGCTA AGGATATGGC TAATTACAGA 2 520 

TATCCTAAGG GTTTCCGATT C TATCGTT AT TCATCTGGAT TTGTTCC GG A ACCCGGAGAC 2580 

ATAGCAGTTT GGCACCCTGG CAACGGAATA GGTTCGGACG GACACACCGC AATAGTAGTA 2640 

GGACCATCTA ATAAAAGTTA TTTTTATAGC GTTG AC C AAA ACTGGGTTAA TTCTAATAGT 2700 

TGGACAGGTT CTCCAGGAAG ATTAGTAAGA CACCCTTATG T AAG TGTTAC AGGCTTTGTT 2760 

AGGCCTCCGT ACTCAAAAGA TACTAGCAAA CCTAGTAGTA CTGATACAAG TTCAGCATCA 2 820 

AAAAGCCAAT GACTCAACAA TTACTGGCGA AGCGAAGAAA CCGCAATTTA AAGAAGTTAA 2 880 

AAC AG TAAAA TACACTGCTT ACAGCAATGT TTAGATAAAG AAGAGCACTT CATTGATCAT 2940 

ATAGTTGTAA TGGGTGATGA ACGCTCAGAT ATTCAAGGAT TATATATAAA AGAATCAATG 3000 
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CATATGCGTT CTGTAGACGA ACTTTATACG CAAAGAAATA AG TTTAT AAG CGATTATGAA 3060 
ATACCGCATT TATATGTCGA TAGAGAGGCT ACAGGAATTC CGGAATTCCG 3110 

5 

(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 2553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:258: 



25 



30 



35 



40 



45 
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AAATTATATA 


TAATAACGTC 


GTTTTAACTA 


AGGCAACATA 


AGGAGGTGCG 


TCAATTGGCT 


60 


CAAGCAAAAA 


AGAAATCGAC 


AGCTAAGAAA 


AAAACAGCAT 


CAAAAAAAAG 


AACAAATTCA 


120 


AGGAAAAAGA 


AGAATGATAA 


TCCGATACGT 


TATGTCATAG 


CTATTTTAGT 


AGTTGTATTA 


180 


ATGGTGTTGG 


GTGTTTTCCA 


ATTAGGAATA 


ATCGGTCGTC 


TAATTGACAG 


CTTCTTTAAT 


240 


TATTTATTTG 


GGTACAGTAG 


ATATTTAACA 


TATATTTTAG 


TACTCTTAGC 


AACTGGTTTT 


300 


ATTACATACT 


CTAAACGTAT 


TCCTAAAACT 


AGACGAACGG 


CTGGTTCGAT 


TGTATTGCAA 


360 


ATTGCATTGC 


TATTTGTATC 


ACAGTTAGTT 


TTTCATTTTA 


ATAGTGGTAT 


CAAAGCTGAA 


420 


AGAGAACCTG 


TACTTTCTTA 


TGTATATCAG 


TCATACCAAC 


ACAGTCATTT 


TCCAAATTTT 


480 


GGTGGCGGTG 


TATTAGGTTT 


TTATTTATTA 


GAGTTAAGCG 


TACCTTTAAT 


TTCATTATTT 


540 


GGTGTATGTA 


TTATTACTAT 


TTTATTATTA 


TGCTCAAGTG 


TTATTTTATT 


AACAAACCAT 


600 


CAACATCGTG 


ATGTTGCAAA 


AGTTGCACTG 


GAAAATATAA 


AAGCTTGGTT 


TGGTTCATTT 


660 


AATGAAAAAA 


TGTCGGAAAG 


AAACCAAGAA 


AAACAATTGA 


AGCGTGAAGA 


AAAAGCGAGA 


720 


CTTAAAGAAG 


AACAAAAGGC 


ACGTCAAAAT 


GAACAGCCAC 


AAATAAAAGA 


TGTGAGTGAT 


780 


TTTACGGAAG 


TGCCTCAAGA 


AAGAGATATT 


CCAATTTATG 


GGCATACTGA 


AAATGAAAGT 


840 


AAAAGCCAGT 


GTCAACCAAG 


TCGAAAAAAA 


CGAGTGTTTG 


ATGCAGAGAA 


TAGTTCGAAT 


900 


AACATCGTAA 


ATCATCAAGC 


AGATCAGCAA 


GAGCAATTAA 


CAGAACAAAC 


TCATAACAGT 


960 


GTTGAAAGTG 


AAAACACTAT 


TGAAGAAGCT 


GGTGAAGTTA 


CGAATGTATC 


GTATGTTGTT 


1020 


CCACCGTTAA 


CTTTACTTAA 


TCAACCTGCA 


AAACAAAAAG 


CAACATCTAA 


AGC TGAAGT A 


1080 


CAACGTAAAG 


GACAAGTACC 


AAAGAGATAC 


ATNAAAAGAT 


TTTGGGGTNA 


AATCCNAAAG 


1140 


TNGACACAAA 


TNAAATTGTC 


CTNCAGTAAC 


TCAATATGAA 


ATCCCACCCA 


GCTCAANGGG 


1200 


GGTTNAAAGT 


GAGTAAAATT 


GTAAACTTTG 


CATAATGATA 


TTGCATTAGC 


TTTAGCACCA 


1260 


AAAGATGTTA 


GAATCGAAGC 


ACCAATACCT 


GGTCGC TCTG 


CAGTAGGTAT 


TGAAGTGCCA 


1320 


AATGAGAAAA 


TTTCATTAGT 


TTCACTAAAA 


GAAGTTTTAG 


ATGAAAAATT 


CCCGTCTAAT 


1380 


AAT AAAC TAG 


AAGTTGGATT 


AGGAAGAGAT 


ATATCAGGTG 


ATCCAATTAC 


TGTTCCACTA 


1440 


AATGAAATGC 


CACACTTATT 


GGTGGCAGGA 


TCGACGGGTA 


GTGGTAAATC 


TGTTTGTATA 


1500 


AATGGTATTA 


TTACAAGTAT 


TTTATTAAAT 


GCTAAGCCGC 


ATGAAGTTAA 


ACTTATGTTA 


1560 


ATCGATCCGA 


AAATGGTTGA 


ACTAAATGTT 


TATAACGNNN 


NNNCACATTT 


ATTAATTCCG 


1620 


GTTGTTACAA 


ATCCTCATAA 


AGCTGCTCAA 


GCTTTAGAAA 


AAATTGTAGC 


TGAGATGGAA 


1680 


AGACGTTATG 


ATTTATTCCA 


ACATTCATCA 


ACTAGAAATA 


TTAAAGGTTA 


TAACGAATTA 


1740 
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5 



10 



15 



ATCCGTAAGC 


AAAATCAAGA 


ATTAGATGAG 


AAGCAACCAG 


AATTACCTTA TATCGTTGTT . 


1800 


ATTGTAGATG 


AGCTTGCAGA 


TTTAATGATG 


GTAGCTGGTA 


AAGAAGTTGA 


AAATGCGATT 


1860 


CAACGTATCA 


CACAAATGGC 


ACGTGCAGCA 


GGT AT AC ATT 


TGATTGTAGC 


AACACAAAGA 


1920 




ATfSTfc.ATTA.C 
niul nA J. l r\ V— 




AAAAATAACA 

********** *■ •VfcVO 


TTCCATCTAG 


AATTGCTTTT 


1980 


GCTGTGAGTT 


CTCAAACAGA 


TTCAAGAACT 


ATTATTGGTA 


CTGGCGG CGC 


AGAAAAGTTA 


2040 


CTTGGTAAAG 


GGTGACATGT 


TATACGTTGG 


AAATGGTGAT 


TCATCACAAA 


CACGTATTCC 


2100 


AAGGGGCGTT 


TTTAAGTGAC 


CAAGAGGTGC 


AAGATGTTGT 


AAATTATGTA 


GTAGAACAAC 


2160 


AACAGGCAAA 


TTATGTAAAA 


GAAATGGAAC 


CAGATGCACC 


AGTGGATAAA 


TCGGAAATGA 


2220 


AAAGTGAAGA 


TGCTTTATAT 


GATGAAGCGT 


ATTTGTTTGT 


TGTTGAACAA 


CAAAAGGCAA 


2280 


GTACATCATT 


GTTACAACGC 


CAATTTAGAA 


TTGGTTATAA 


TAGAGCATCT 


AGGTTGATGG 


2340 


ATGATTTAGA 


ACGCAATCAG 


GTAATCGGTC 


CACAAAAAGG 


AAGCAAGCCT 


AGACAAGTTT 


2400 


TAATAGATCT 


TAATAATGAC 


GAGGTGTAAA 


AAATTTGATT 


AAGCAAAATA 


AATTGGAATA 


2460 


TGGTGACCAA 


CTATCCAAGT 


AATTTATCAA 


TTGCCAGAGA 


ATTAAATGTA 


AAAACCGACG 


2520 


ATGTTTATGA 


AGCAATTCAG 


GCATTGATTA 


CTG 






2553 



20 

(2) INFORMATION FOR SEQ ID NO: 2 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1664 base pairs 

25 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 



40 



45 



SO 
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CCGTACATTA 


ACAACCTGGA 


TTAATTACAG 


ATTACGTTTT 


GTCTCATCTA 


AAAATTGTTG 


60 


ATAATCTTTG 


ATCAATTTCC 


AAATCTGTAC 


GTATGTATTA 


TCTTAGTAGT 


AGTTTAATAT 


120 


GCACTAAATA 


CTTGCAATTG 


ATATTTACCA 


TATTTATTGT 


CAAATCCAAT 


TATCTTGTGT 


180 


TTTTCATAAA 


ACGATTGCTT 


TAAATAATCT 


TCTAACACAT 


CAAACATCGT 


ATTATCACCG 


240 


ACATGGTGCC 


CGTATAAAAT 


AGTATTATGA 


TTTAAATTCT 


NCAATTCATT 


TCTAAAATCC 


300 


ATAAAAATAC 


TACCTTTACG 


TCGATGTTCT 


CGCTCAAAAT 


CTAAATTTAA 


AT AATCG TG A 


360 


TTTGTCTTAC 


CTTGTAGTAC 


TGGATAANTT 


AATGATGTTC 


CTGATAATTT 


TATCCATCCA 


420 


ACAATGTCTT 


TATTTATTTT 


TTCAAGTGAT 


TCAAATTGTG 


GTCTCACATG 


TTCTTGATGT 


480 


TTGNTGATCA 


GCATNTGAAA 


TTTNTGTTGT 


AATTTCTCAT 


AANTTGCGCG 


TTCTTGCTTG 


540 


TCTTCAATAT 


ATGTTTGAAC 


AATTTTGTAA 


CCAAAAATGA 


TAATAATTAC 


AACCAATAAA 


600 


ATTTGTACAA 


TAGTTAAAAA 


TCGCTTCATT 


CTCATAAAAA 


CCCTCTTTAA 


TAAACAANTT 


660 


TCTTCAGTCA 


TCACTAAACC 


AGTTGTTGTA 


CCGTTTTAGA 


TTCGATTTCG 


TTGACTTTGA 


720 


CAAATTAAGT 


AAATTAGCAT 


TGGACCACCG 


ACAATCATTA 


AAATAGCATT 


GGCTGGAATT 


780 


TCTAAAGGAG 


GCTGTATCAC 


TCGTCCTAAT 


AAATCAGCCA 


CTAACAATAG 


CCATGCACCA 


840 


ATAACTGTAG 


AAAACGGAAT 


AAGTACTCTG 


TAATTGCCCC 


CAACTAGCTT 


TCTAACCACA 


900 


TGTGGCACAA 


TAATACCTAA 


AAAGGC TAG T 


TGTCCAACAA 


TCGCAACAGT 


TGCACTTGCT 


960 


AAAAATACTG 


CTAATAAACC 


TGTTAACCAT 


CTGTAACGAT 


CAATATTAAA 


ACCGATACTT 


1020 
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10 



CGCGCTTGTA TGTCGTCTAA ATTTAGTAAA TTCAATTTAG GGGACAATAG TAATGTTAAT 1080 

ATTAATCCCA ATAATGCTGA TACTGCTAAT ATGTATACGT CGCTC C AT AT TTTCATTGTT 1140 

AAGCCTTGAG GAATTTTCAT TAAAGGGTTT TGAGTTAAAA TTTCTAAAAC ACCATTTAAT 1200 

AATACGNATA ACGCAACACC TACTAATATC ATACTTACAG CATTGAATCT AAATTTAGAA 1260 

TGCAACAATA TAATTATTAA AAATGGTATT AAACCTCCAA TAAAACTTAA TAATGGTAAG 1320 

TAAAAGTACA ATTGTGGAAT AAACAACATA CCAAGTGCTC TCATTATAAG TGCACCTGAG 1380 

GAAACGCCAA TGATATTCGC CTCTGCCAAA GGATTTTGTA GTCCTGCTTG TAATAATGCT 1440 

C C AG AAACTG CTAACATTGC GCCANCCATC AATGCAANTA ATAGACGTGC CAATCCCAAA 1500 

TCAATGATTG AATCCACTGC TTCATTGCTA CCAGTTGTAA ATTTTGTAAA TAGGTCATTA 1560 

AATGACAATT TAATTGTACC GGTTACAAAC GAAATATAAG CAGTTGCGAT TAAAATGACT 1620 

AACAAACATA AAAATAGTAG TTTCTTTTTA TTTTTTATCA TCAG 1664 

75 

(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS ; single 
<D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:260: 

30 

Val Leu His Xaa He Arg Glu He Leu Leu Gin Leu Val His His Ser 

1 5 10 15 

Asp Xaa Gly Xaa Gly He Gin Pro Pro Lys Phe Phe Gin Pro Gly Asp 
55 20 25 30 

Xaa Xaa Lys Val Thr He Asp Asn He Gly Thr Leu Thr Thr Tyr He 
35 40 45 

Xaa Lys 
40 5 0 

(2) INFORMATION FOR SEQ ID NO: 2 61: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
so (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 61: 
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50 



Val Pro He Lys Ala Val He Ala Gin Thr Met Thr Thr Leu Pro Arg 

15 10 15 

Val He Asn He Lys Arg Leu Thr Ser Gly Leu Asn Lys Pro Leu Ser 

20 25 30 

Phe Met Met Pro Lys Lys He Met Ala Xaa He Asn He He Thr Thr 

35 40 45 

Xaa Met Xaa 
50 

(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: Protein 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 

Val lie Lys Phe Lys Asn Val Thr Lys Arg Tyr Gly Lys His Val Ala 
so 1 5 10 15 

Val Asp Asn He Ser Phe Asn He Asn Glu Gly Glu Phe Phe Val Leu 

20 25 30 

He Gly Pro Ser Gly Cys Gly Lys Thr Thr Thr Leu Lys Met He Asn 
35 35 40 45 

Arg Leu He His Leu Ser Glu Gly Tyr He Tyr Phe Lys Asp Lys Pro 

50 55 60 

He Ser Asp Tyr Pro Val Leu Arg Asn Ala Leu Gly Tyr Trp Xaa Arg 
40 65 70 75 80 

Xaa Gly Xaa Arg Leu Glu Leu Xaa Pro He 
85 90 

45 (2) INFORMATION FOR SEQ ID NO: 2 63: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



55 



(ii) MOLECULE TYPE: Protein 
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10 



20 



35 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

Val Cys Cys phe Gly Ala Ser Thr Met Leu Ser His lie Thr Ala Tyr 

15 10 15 

Glu lie Leu Ser Glu lie Xaa Lys Lys Leu Ala Gin Lys Leu Met Arg 

20 25 30 

Leu Pro Leu Xaa Val Val Glu Ser Lys Lys lie Gly Glu Leu Lys Asn 

35 40 45 

lie Phe Xaa Asp Lys Val Glu Thr lie Glu Leu Pro Leu Ala His Met 

50 55 60 

He Pro Glu Val Xaa Gly Asn Leu Leu Val Ala Ala Ala He Phe Leu 
65 70 75 80 

Tyr He Met Xaa Xaa Asp Trp Arg He Gly 
85 90 

(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:264: 

Val Thr Ser Phe He Tyr Lys He Leu Tyr Val Val Lys lie Asn Ala 

15 10 15 

Tyr Thr Tyr Asp He Met Thr Glu Asp He Met Xaa Leu Ser lie Leu 

20 25 30 

Leu He Phe Leu Cys He Arg Leu Val Ser Leu Lys lie Cys lie Xaa 

35 40 45 

His Ser Lys Gin Leu Lys Ala Asp Gly Ala Val Glu Tyr Gly Val Lys 

50 55 60 

Asn Ser Lys Phe Leu Ala lie Thr His Val Leu lie Tyr Val Leu Ala 
65 70 75 80 

Gly Val Glu Ala Phe lie Asn Lys Asp Thr Phe Ser Phe Ala Asn Gly 

85 90 95 

lie Gly Leu Val lie Leu lie Phe Ala Tyr lie Met Leu Phe Met Val 

100 105 110 

lie Lys Thr Leu Gly Gly lie Trp Thr Leu Lys Leu Phe lie Leu Pro 
115 120 125 
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.. Asn His Pro He He Lys Ser Gly Leu Tyr Lys He Thr Lys Thr Pro 
130 135 140 

Lys Leu Leu Leu Lys His His Ser 
145 150 

(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 104 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 265: 

Val Arg Val Thr Gly He Xaa Pro Tyr Gly Ala Phe Val Glu Thr Pro 

15 10 15 

Asn His Thr Glu Gly Leu He His He Ser Glu He Met Asp Asp Tyr 

20 25 30 

Val His Asn Leu Lys Lys Phe Leu Ser Glu Gly Gin He Val Lys Ala 

35 40 45 

Lys He Xaa Ser He Asp Asp Glu Gly Lys Leu Asn Leu Ser Leu Xaa 

50 55 60 

Asp Asn Asp Tyr Phe Lys Asn Tyr Glu Arg Lys Lys Glu Lys Gin Ser 
65 70 75 80 

Val Leu Asp Glu He Arg Xaa Thr Glu Lys Tyr Gly Xaa Gin Thr Leu 

85 90 95 

Lys Arg Thr Leu Thr Asn Leu Gly 
100 

(2) INFORMATION FOR SEQ ID NO: 266; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 6: 



221 



EP 0 841 394 A2 



15 



Val Thr Xaa Gly Val Xaa Glu Leu He He Xaa Ser Gly Glu Glu Glu 

15 10 15 

Ser Ser Ser Gin Leu Cys His Pro His Ser Val Lys Asp Val Thr Gly 

20 25 30 

Ala Gly Asp Ser Phe Cys Ala Ala Val Val Tyr Ser Trp Xaa Asn Gly 

35 40 45 

Met Ser Thr Val Asp lie Leu He Ala Gly Met Val Asn Ala Xaa Lys 

50 55 60 

Thr He Glu Thr Lys Tyr Thr Val Arg His Asn Leu Asp Gin Gin Gin 
65 70 75 80 

Leu Tyr His Asp Met Glu Asp Tyr Lys Asn Gly Lys Phe Thr Gin Val 
85 90 95 

Tyr 



(2) INFORMATION FOR SEQ ID NO: 2 67: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 56 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 67: 

35 val Asn Lys Asn Xaa Thr Thr Met Ser His He Thr Xaa Xaa Gin Ser 
15 10 15 

Ala He Leu Ala Ser Lys Val Asn Ala Pro Ser Val Tyr Asn Xaa Asn 
20 25 30 

40 Asn Met Ser Glu Asn Phe Thr Gin Arg Val Lys His Xaa Leu Arg Lys 
35 40 45 

Asn Glu Ala Thr Thr Xaa Tyr Gin 
50 55 



45 



(2) INFORMATION FOR SEQ ID NO: 2 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



55 
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(ii) MOLECULE TYPE: Protein 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO:268: 

Val Ser Ser Arg Asp lie Gly Glu His Val Met Asn Leu Leu Met His 

15 10 15 

Val Asp Gin Val Ser Tyr Val Arg Phe Ala Ser Val Tyr Lys Glu Phe 

20 25 30 

Lys Asp Val Asp Gin Leu Leu Ala Ser Met Gin Gly lie Leu Ser Glu 

35 40 45 

Asn Lys Arg Ser Asp Ala 
50 

(2) INFORMATION FOR SEQ ID NO: 2 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 69: 

Val Ala Xaa Val Xaa lie lie Xaa Phe Met Met Leu Xaa Pro Xaa Gly 

15 10 15 

Gly Phe He Ala Leu Glu Gin He Gly Glu Arg Met Asn He Ala Ala 

20 25 30 

He Glu Val Asp Lys Ser Tyr Arg Phe Asn Gly He Gly Ser Ser Ala 

35 40 45 

Leu Arg Gin Leu Pro Thr Tyr Leu Arg Lys Asn Tyr Asp Asn Leu Asn 

50 55 60 

Val He Thr Met He Leu Phe Gly Glu Asn Asn Asp Phe Lys Pro Leu 
65 70 75 80 

Cys Leu Asn Ser Asn Phe Val Glu He Glu Gin Thr Asp Asp Tyr Val 

85 90 95 

Val Phe Glu Lys Tyr Leu Asn Tyr 
100 

(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 90 amino acids 
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.(B) TYPE: amino acid . v 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

Val Glu lie Lys Val His Ala Leu His Ser Gly lie Thr Leu lie Lys 

15 10 15 

Asp Lys Gly Lys lie lie Asp lie His Leu Ser Val Lys Ala Thr Glu 

20 25 30 

Asn lie Asp Gly Glu Val Leu Phe Lys Ala Thr Gin Pro Leu Gly Arg 

35 40 45 

Thr Met Lys Val Gly Val Gin Asn Asn Ala Met Thr lie Thr Leu Thr 

50 55 60 

Lys Gin Asn Gin Trp Leu Asp Ser Leu Lys Phe Leu Val Lys Cys He 
65 70 75 80 

Glu Glu Ser Met Arg He Ser Asp Glu Ala 
85 90 

(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271: 

Val Glu Tyr Phe Val Ser Tyr Tyr Asp Tyr Tyr Gin Pro Glu Ala Tyr 

15 10 15 

Val Pro Ser Thr Asp Thr Phe He Glu Lys Asp Ala Ser He Asn Asp 

20 25 30 

Glu He Asp Xaa Xaa Arg His Ser Ala Thr Ser Ala Leu Phe Glu Arg 

35 40 45 

Asp Asp Val He He He Ala Ser Val Ser Cys He Tyr Gly Leu Gly 

50 55 60 

Asn Pro Glu Glu Tyr Lys Asp Leu Val Val Ser Val Arg Val Gly Met 
65 70 75 80 
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Glu Met Asp Arg Ser Glu Leu Leu Arg Lys . Leu Val Asp Val Gin Tyr. /t 

85 90 95 

Thr Arg Asn Asp lie Asp Phe Pro Thr Arg Asn Arg Phe Glu Cys Val 
100 105 110 

Val Met 



(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS r single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 

Val Gly His He Leu He Glu He Ser Xaa Asp Gly Pro Gly Phe Met 

1 5 10 15 

Glu Asp Xaa Pro Tyr Glu Thr Leu Gly Xaa Gly Leu Ser Leu Pro Pro 

20 25 30 

Phe Leu Glu Asn Lys Arg Glu Tyr He Glu Ser Glu lie Arg Pro Phe 

35 40 45 

Asn Thr Lys Arg 
50 

(2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 

Val Ser He Thr Leu Xaa Pro Ala Leu Ala Ala Thr Leu Phe Lys Lys 

15 10 15 

Gly Val Lys Arg Arg Asn Lys Gin His Gin Glu Gly Leu Gly Val Val 
20 25 30 
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Ser Thr Thr Tyr Lys Xaa Val Xaa His Trp Val Thr Lys Ser , 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

{CJ STRAND EDNESS : single 
{ D ) TOPOLOGY : 1 inea r 

(ii) MOLECULE TYPE: Protein 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274.: 

Val Val Gin Glu Thr Leu Asn Met Ser Lys Ala Arg Gly Tyr Glu Val 

15 10 15 

Gly Gly Thr Val Arg lie Val lie Asn Asn Gin Val Gly Phe Thr Thr 

20 25 30 

Ser Asn Pro Leu Asp Ala Arg Ser Tyr Ala Val Leu Tyr 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 

Val Glu Gly Asn Phe Gly Leu Ala lie Asn Asn His Gin Asn lie Val 

15 10 15 

Gly Thr lie Gly Leu lie Arg Leu Asp Asn Asn Met Ser Ala Leu Lys 

20 25 30 

Lys Met Phe Val Asp Lys Gly Tyr Arg Asn Leu Lys lie Gly Lys Lys 

35 40 45 

Leu Leu Asp Lys Val lie Met Thr Cys Lys Glu Gin Asn He Asp Xaa 

50 55 60 

Xaa Tyr Leu Gly Thr Xaa Asp Lys Xaa Xaa Ser Ala Gin Tyr Xaa Tyr 
65 70 75 80 
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Xaa Asn Xaa Gly Phe Arg Xaa Asn 
85 

(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:276: 

Val He Gly Leu Lys Ser Lys Thr He He Ser Asn Glu Trp Gly Ala 

15 10 15 

He Met He Gin Ser Met Trp Phe Asn Leu His Val Gin Asp Leu Glu 

20 25 30 

Lys Ser Ala Gin Phe Tyr Lys Ala Leu Gly Phe Lys He Asn Arg Asn 

35 40 45 

Pro Gin Met Leu Asp Lys Met Val Gly He Gin lie Gly Gin Thr Thr 

50 55 60 

Ala He Leu He Glu Asn Lys His Phe Gin Asn Val Ser Gin Gin Ser 

65 70 75 80 

Leu Asn Thr Glu Pro Asn Glu Val Met He Ser Leu Gly Val Thr Gin 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 2 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

Val Ser Leu Met Thr Thr Phe Ser Glu Lys Glu Lys He Gin Leu Leu 

15 10 15 

Ala Asp He val Glu Leu Gin Thr Glu Asn Asn Asn Glu He Asp Val 
20 25 30 
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Cys Asn Tyr Leu Lys Asp Leu Phe Asp Lys. Tyr Asp He Lys Ser Glu v . 

35 40 45 

lie Leu Lys Val Asn Glu His Arg Ala Asn He Val Ala Glu He Gly 

50 55 60 

Asn Xaa Ser Pro He Leu Ala Leu Ser Gly His Met Asp Val Val Gly 
65 70 75 80 

Cys Arg Lys Ser Arg 
85 



(2) INFORMATION FOR SEQ ID NO: 2 78: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 



Val Glu Xaa Met Asn Lys His Tyr 

1 5 
Thr Ala Gly Val Thr Val Ala Ser 
20 

Leu Lys Glu Lys He Ala He He 

35 40 
Gin Pro Leu Trp Thr Leu Val Gly 

50 55 
Xaa Ser 
65 



Xaa He Val He He Gly Gly Gly 

10 15 
Arg Leu Leu Arg Lys Asn Xaa Asn 
25 30 
Asp Pro Ala Asp His His Tyr Tyr 
45 

Ala Gly Val Ser Ser Leu Lys Ser 
60 



(2) INFORMATION FOR SEQ ID NO: 279: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:279: 
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Val Leu Arg Asn Lys lie Lys Arg 

1 5 
His Lys Ser Xaa lie Leu Xaa Lys 
20 

Ala Ala Lys Asp Met Thr Thr Leu 

35 40 
Xaa Leu Lys lie Gly Lys Val Phe 
50 55 



Ala lie. Arg Glu Asn Phe Lys Val v . . 

10 15 
Asp lie lie Xaa lie Xaa Arg Gin 
25 30 
Gin lie Gin Xaa Ser Leu Glu His 
45 

Asn Lys Lys lie Lys 
60 



(2) INFORMATION FOR SEQ ID NO: 2 80: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: Protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:280: 



Val Asp Gly Val Ser Leu Ala Val 

1 5 
Asp lie His Leu lie Pro Glu Thr 
20 

Thr Lys Leu Gly Asp Lys Val His 

35 40 
Tyr Val Glu Asn lie Leu Asn Lys 

50 55 
Lys Leu Arg Ala Phe Xaa Phe 
65 70 



Phe Asp Lys His Asp Asn Ser Phe 

10 15 
Arg Arg Ser Thr lie Leu Ser Ser 
25 30 
Leu Glu Thr Asp Val Leu Phe Lys 
45 

Asp Lys Asp Gin Leu Ser Val Asp 
60 



(2) INFORMATION FOR SEQ ID NO: 2 81: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 81: 
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Val Xaa Gin Ser Leu Tyr Glu Phe Leu Glu. Glu Asn He Asn Tyr Leu ... 

15 10 15 

Lys Glu Asn Gly Xaa Tyr Asn Glu He Asp Thr He Glu Gly Ala Asn 

20 25 30 

Gly Pro Glu He Lys He Asn Gly Lys Ser Tyr He Asn Leu Ser Ser 

35 40 45 

Xaa Asn Tyr Leu Gly Leu Ala Thr Asn Glu Asp Xaa Arg Ser Ala Ala 

50 55 60 

Lys Ala Ala He Asp Thr His Gly val Gly Ala Gly Ala Xaa Arg Thr 
65 70 75 80 

Xaa Asn Gly Thr Leu Asp Leu His Asp Glu Leu Glu Glu Thr Leu Ala 

85 90 95 

Lys Phe Xaa Gly Thr Glu Ala Ala He Ala Tyr Gin Xaa Arg He Xaa 

100 105 110 

Leu Xaa He Trp Leu Leu Xaa Gin Leu Val Xaa Glu Xaa Lys 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 2 82: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 51 amino acids 
(S) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:282: 



Val Leu Gly Gly Asn Val Thr Glu Thr His Thr Phe Lys Leu Pro Glu 

15 10 15 

Asp Ala Gly Glu Arg Gin Met Phe He He Asp Lys Lys Arg Gin Thr 

20 25 30 

Pro Lys Lys Tyr Pro Arg Lys Pro Gly Thr Pro Asn Lys Thr Pro Leu 

35 40 45 

Leu Glu Lys 
50 



(2) INFORMATION FOR SEQ ID NO: 283: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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15 



XV) TOPOLOGY : linear 

— \ 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:283: 

Val Lys Glu Asn Asp Val Lys Lys Phe Lys Tyr Lys Asn Phe Xaa Asp 

1 5 10 15 

Glu lie Glu lie Asp Phe Thr Asp Ser Asn His Leu Ala Ala His Arg 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 2 84: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) topology: linear 

(ii) MOLECULE TYPE: Protein 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

Val Thr Pro Asp Ala Asn Gly Lys Val Ala Phe Asp Gly Leu Glu Leu 

1 5 10 15 

Thr Phe Thr Gly Thr Pro Ala Val Asn Asp Ser Phe Thr Leu Lys Pro 

20 25 30 

Val Ser Asp Ala He Val Asn Met Asp Val Leu He Thr Asp Glu Ala 

35 40 45 

Lys He Ala Met Ala Ser Glu Glu Asp Ala Gly Asp Ser Asp Asn Arg 

50 55 60 

Asn Gly Gin Ala Leu Leu Asp Leu Ala Lys Gin Gin 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 2 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



55 (ii) MOLECULE TYPE: Protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 

Val Leu Leu Pro Leu Val Phe lie Ser Val Leu lie Gly lie Phe Asn 

15 10 15 

Tyr lie Lys Val Leu Pro Phe He He Lys Tyr Val Gly He Ala He 

20 25 30 

Asn Lys He Thr Arg Met Gly Arg Leu Glu Ser Tyr Phe Ala He Ser 

35 40 45 

Thr Ala Met Phe Gly Gin Pro Glu Val Tyr Leu Thr He Lys Asp He 

50 55 60 

He Pro Arg Leu Ser Arg Ala Lys Leu 
65 70 



(2) INFORMATION FOR SEQ ID NO: 286: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 



Val He He Ser Asn Asn Val Gly Asp Met Ser He Gly Phe Ser Ser 

15 10 15 

Asp Asp Asn Glu Leu Thr Met His Phe Arg Asn Asn Glu Lys Val Asn 

20 25 30 

He Lys Lys Gly Lys Lys Val Val Leu Ala Ala Gin He Leu Asp Glu 

35 40 45 

Leu Glu Thr Arg Trp Gin 
50 



(2) INFORMATION FOR SEQ ID NO: 287: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 87 : 

Val Thr Tyr Xaa His Met Ser His Arg His Thr Leu lie Ser His Ser 

1 5 10 15 

phe Lys Phe Xaa Arg Leu Pro Gin Asp lie Asn Phe Phe Ser Trp Xaa 

20 25 30 

Gin His Val Lys Asp Ser Asp Lys Thr Asp Glu Leu Thr Tyr Ser Glu 

35 40 45 

Ser Leu Xaa Pro Glu Gly His Pro Thr His Pro Leu Thr Pro Thr Lys 

50 55 60 

Leu Pro Xaa Thr Met Glu Glu Xaa Arg Ala Tyr Ala Pro Glu Phe Asp 
65 70 75 80 

Xaa Arg Asn Pro Phe Ala Thr Leu 
85 

(2) INFORMATION FOR SEQ ID NO: 2 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:288: 

Val lie Ser Leu Leu Lys Asp Ala Lys Leu Pro Ser Ser Val Thr Arg 

15 10 15 

Gly Asp Tyr Leu Ala He Leu Ser Xaa Gly Ala Tyr His Tyr Ser Met 

20 25 30 

Xaa Ser Asn Tyr Asn His Met His Lys Pro Ser Val Phe Phe Leu Lys 

35 40 45 

Asp Val Thr Ala Arg Val Val He Lys Arg Gin Ser Leu Arg Gin Leu 

50 55 60 

He He Asn Xaa Thr Gin 
65 70 

(2) INFORMATION FOR SEQ ID NO: 289: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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.(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:239: 

Val Gin Phe Val Asn Lys Leu Phe Asn Val Ser Ser Ala He He Leu 

15 10 15 

Leu Glu Tyr Asp Gly Val Val His He Gly Tyr Asp Asn Asn Phe Glu 

20 25 30 

Phe Lys Thr Glu Gin Phe Lys Met Ser Lys Ser Arg Asn Leu Leu Lys 

35 40 45 

Asn Arg Ser Gin Asn Xaa Val Leu He Arg Leu Leu Asn Trp Leu Arg 

50 55 60 

Thr Thr Xaa 
65 

(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

Val Gin Leu Gly Leu Thr Asn Ser Val Val Ala He lie He lie Xaa 

15 10 15 

Arg Glu Xaa Ala Val Thr Gly Leu Arg Leu Leu Gin He Glu Gin Gly 

20 25 30 

Xaa Gly Lys Cys Ser Trp Xaa lie Arg 
35 40 

(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 



Val Asp Val Asp Asn Met Ser Asp 

1 5 
Thr Xaa Ser Asp lie Thr Gly Tyr 
20 

Ala Gin Tyr Val lie Ser Gin Leu 

35 40 
Asn Leu Thr Leu Asn Xaa Xaa Glu 

50 55 
Val Leu 
65 



Tyr Lys Leu Lys lie lie Xaa Leu 

10 15 
Gin He Pro Asn Gin Xaa Gly Val 
25 30 
Ser Gin Gly Lys Arg Glu Val Asp 
45 

Xaa Leu Tyr Ser Tyr Xaa Arg Gin 
60 



(2) INFORMATION FOR SEQ ID NO:292: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:292: 



Val Lys Gin Thr Xaa Val Glu Glu He Xaa Ala Ser He Asn Xaa Ala 

15 10 15 

Gly Leu Xaa Xaa Glu He Pro Asp Phe Lys Xaa Glu Val Xaa Xaa Xaa 

20 25 30 

Xaa Lys Lys Trp Arg Leu Cys Tyr 
35 40 



(2) INFORMATION FOR SEQ ID NO:293: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID.NO:293: 



Val Ser Thr Asp Phe He Leu Cys Lys Leu Gin Ala Phe His He He 

15 10 15 

Lys Phe Glu Lys Arg Tyr He Glu Val Glu Lys Asn Glu Tyr Thr Ala 

20 25 30 

Lys Tyr Asn Glu Tyr Ser Gin Leu Leu Asp Ala Thr Tyr Ser Gin Ala 

35 40 45 

Val Ala Tyr Leu Leu Asn Lys Tyr Gly Ala Val Thr Asp Asp Tyr Tyr 

50 55 60 

Lys Glu Lys Ser Tyr Thr Arg Phe Leu Asn Gly Glu He Lys Ser He 
65 70 75 80 

Ser Lys Gly Lys Tyr Thr Arg Ala Ser Glu Gly Leu Tyr Cys His His 

85 90 95 

He Ser Glu Asp Lys Phe Gin Asn Leu Ser 
100 105 

(2) INFORMATION FOR SEQ ID NO: 294: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 



Val He Leu Ser Ser He Xaa Phe 

1 5 
Pro Met Ala He Gly Val Trp Ala 
20 

Thr Val He Asp Trp Asp He Ser 

35 40 
Met Gin Ala Thr Lys Ala Arg Ala 

50 55 
Lys Leu Leu He Xaa He Asp Asp 
65 70 
Asp Xaa Xaa Glu Ser Arg 
85 



Tyr Met Gin Leu Leu Gin Lys Asn 

10 15 
Gly Leu Cys Gin Met He Lys Pro 
25 30 
Glu Tyr Thr Pro Lys Pro Ala Trp 
45 

Tyr Val Pro Arg He Leu Val Glu 
60 

Met Leu Glu Gly He Glu He Tyr 
75 80 



(2) INFORMATION FOR SEQ ID NO: 295: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 



Val Xaa Thr Tyr Val Asn Asp Tyr 

1 5 
Lys Tip Ser Glu Met Met Leu Trp 
20 

Thr Val Gin Leu lie Asn Lys lie 

35 40 
lie Ser Lys Gly Glu Leu Gly He 

50 55 
Lys Lys Ala Met Ser Asn Thr Ala 
65 70 
Gly Leu 



Leu Asn Glu Ala His He Thr Asn 

10 15 
Ser Ser Gin Gin Arg Ser Gin Tyr 
25 30 
Glu Thr Asp Asp Ser Tyr He His 
45 

Ser Leu Met Pro His He Gin Leu 
60 

Ser His He Glu Asp Leu Ser Cys 
75 80 



(2) INFORMATION FOR SEQ ID NO: 29 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 96: 



Val Cys Lys Lys Tyr Val Leu Gly Gin Phe Lys Pro Gly Met Thr Ala 

15 10 15 

Pro Gin Cys Ala Gly He He His Thr Asp Phe Xaa Arg Gly Phe He 

20 25 30 

Arg Ala Gly Ser Asn Lys Leu Leu Met Thr Met Tyr Asn Met Ala Ala 

35 40 45 

Lys Val Ala Leu Lys Lys Arg Ala Asp Thr Asp 
50 55 
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. (2) INFORMATION FOR SEQ ID NO: 297: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 



Val Leu Thr Leu Ala Glu Met Lys 

1 5 
Phe Arg Asp Glu Asp lie Trp Xaa 
20 

Ser Pro Xaa Leu Xaa Asp Asp Xaa 

35 40 
Xaa Asn Ala Tyr Asn Xaa Tyr Xaa 
50 55 



Arg Thr lie His Asp lie Leu Asp 

10 15 
Cys Tyr Leu Gly Thr Leu Ala Val 
25 30 
Gly Xaa Xaa Leu Leu Ser Arg Xaa 
45 



(2) INFORMATION FOR SEQ ID NO: 298: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:298: 

Val Phe Asn Tyr Gly Gin Ser Val Phe Glu Gly Leu Lys Ala Tyr Lys 

15 10 15 

Arg Asp Gly Glu Gly Cys Thr Xaa Pro Xaa Pro Glu Glu Asn Phe Lys 

20 25 30 

Arg Leu Asn Asn Ser Leu Ala Arg Leu Glu Met Pro Gin Val Asp Asp 

35 40 45 

Ala Glu Leu Leu Glu Gly Leu Lys Xaa Leu Val Asp lie Glu Arg Asp 

50 55 60 

Xaa Xaa Pro Glu Gly Glu Gly Gin Ser Leu Tyr Xaa Xaa Pro Phe Gly 
65 70 75 80 
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Xaa Ala Thr Glu Gly Ala Xaa Gly Val Gly Ala Ser His Gin Tyr Xaa v . 

85 90 95 

lie Tyr 



(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 



Val Arg Phe lie Leu Glu Asp Pro Pro Arg Asp Leu Asp He Tyr He 

15 10 15 

Asp He Ala Glu Arg Ala Gly Gin His Pro Xaa Val Glu Arg He Xaa 

20 25 30 

Asp Gly Xaa Gin Arg Arg He Asp He Xaa Thr Val Glu Ser Cys 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 3 00: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 



Val Val Lys Thr Arg He Leu Lys 

1 5 
Pro Leu Lys Pro Thr Gly Ser His 
20 

Leu Arg Gin Val Pro Tyr Gly Thr 

35 40 
Thr Lys Val Gly Lys Val Leu Asp 
50 55 



Val Asp Asn Pro Glu He Thr He 

10 15 
Phe Gin Gin Cys Val Trp Asn Glu 
25 30 
Leu Thr Thr Tyr Gly Ala He Ala 
45 

Lys Pro Gin Met Ser Ala Gin Ala 
60 
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Val Gly Gly Ala Val Gly Ser Asn Pro Leu . Ser lie lie Val Pro Cys ... 
65 70 75 80 

His Arg Val Val Gly Lys Thr Gly Ser Leu Thr Gly Phe Gly Gly Thr 

85 90 95 

lie Asn Asn Lys lie Lys Leu Leu Glu Leu Glu Asn lie Asp Met Ser 

100 105 110 

Lys Leu Tyr lie Pro Lys His Ser Thr Lys Pro 
115 120 

(2) INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 

Val Cys Leu Val Glu Cys Met Met Ala Leu Asn Thr Ala Gly Xaa Thr 

15 10 15 

Gin Arg Thr Val Glu Thr Leu Ala Glu Tyr Ser Gly Val Pro val Trp 

20 25 30 

Asn Gly Leu Thr Asp Glu Asp His Pro Thr Gin Xaa Leu Ala Asp Phe 

35 40 45 

Leu Thr Ala Lys Glu Val Phe Lys Lys Arg Leu Cys Arg Tyr 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 

Val Asn Leu Pro Asp Phe Pro Pro Lys lie Gly Val Asn Lys Ser Thr 
15 10 15 
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. Leu Ser Arg Tyr Xaa Xaa Gly Ser 
20 

Ala Glu lie Ala Asn Ala Leu Lys 

35 40 
Xaa Asn Arg Gin Pro Glu Xaa Glu 

50 55 
Glu Gly Glu Leu Thr Asp Asp Glu 
65 70 
Asp Tyr lie Arg Ser Lys Arg Lys 
85 



Arg Xaa. He Pro Met Glu Asp He-. 
25 30 
Val Thr Pro Glu Tyr Leu Leu Leu 
45 

Val Gin His Arg Ala Ala His Leu 
60 

Cys Gin Arg Val Leu Asp Tyr Ala 
75 80 



(2) INFORMATION FOR SEQ ID NO: 303: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 



val Lys Pro Xaa Cys Ala Lys Lys Asp Arg Tyr Pro He Asp Pro Met 

15 10 15 

Arg Arg Leu Asn Gin Glu He He Asp Tyr He Asp Thr Leu Asn Tyr 

20 25 30 

He Asp Gin Tyr Ser Ser Ala Gly Ser Phe Arg Arg Phe Lys Glu Met 

35 40 45 

Ser Lys Asp Leu Asp Phe He He Ser Thr Asp Asn Pro Gin Ala Val 

50 55 60 

Gin Gin Gin Leu Leu Asn He Pro Asn Lys Val Lys Glu Val Ala Val 
65 70 75 80 

Gly Asn Thr Gin Val Ser Leu Glu Leu Ala Tyr Asp Asp Glu Thr He 

85 90 95 

Gly Val Asp Phe Arg Leu He Glu Pro Arg Ala Phe Tyr His Thr Leu 

100 105 110 

Gin His Phe Thr Gly Ser Asn Arg Thr 
115 120 



(2) INFORMATION FOR SEQ ID NO: 304: 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 116 amino acids 
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.(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 



Val Phe Asp lie Tyr Ser Glu Xaa Asp Leu He Glu He Thr Asp Tyr 

15 10 15 

Pro Xaa Ser He Ser Phe Asn Phe Ala Ser Tyr Met Ser Ala Ser Lys 

20 25 30 

Phe Phe Lys Asp Tyr Ala Leu Xaa Thr Asn Asp Lys Ser Gin Tyr Leu 

35 40 45 

Glu Asp Tyr Asn Gin His Val Ala He Val Ala Leu Tyr Leu Ala Xaa 

50 55 60 

Gly Asn Lys Ala Gin Ala Lys Gin Phe He Ser Ala Met Val Glu Gin 
65 70 75 80 

Arg Tyr Xaa Pro Ala Thr Pro Thr Phe Leu Asn Ala Xaa Arg Ala Arg 

85 90 95 

Arg Gly Glu Leu Val Ser Cys Phe Xaa Xaa Lys Lys Trp Val Xaa Xaa 

100 105 110 

Leu Asn Ser Xaa 
115 



(2) INFORMATION FOR SEQ ID NO: 305: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 05: 

Val Met Glu Val Thr Cys Leu Lys Gin He Thr Trp His Asp Leu Gin 

15 10 15 

His He He Lys Asp Gly Asp Val He Gly Leu Pro Ala Leu Ala Val 

20 25 30 

Ala Asn Leu Pro Ala Glu Val Leu Arg Ala Val Leu Ala Gin His Asp 
35 40 45 
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Thr Tyr His Thr Pro Lys Asp Leu Thr Phe. He Leu Ala Asn Asp Ile v 

50 55 60 

His Ser Leu Gly Ala Ala Pro Asp Leu Asp Asp Phe He Glu Arg Arg 
65 70 75 80 

Met He Lys Arg Val He Met Ser He Leu Thr Ala Ser Ser Lys Thr 

85 90 95 

Ala Gin Ala Met Lys Asn Asn Asp He Glu Ala Tyr Phe Leu Pro Gin 

100 105 110 

Gly He He Ala Thr His Tyr Arg Gin Ser Asn Gin Leu Leu Pro Gly 

115 120 125 

Val He Thr Lys He Gly Leu Asn Thr Ala Val Asp Pro Arg Tyr Gly 

130 135 140 

Gly Gly Lys Val Asn Thr Arg Thr Thr Asp Asp Leu Val Ser Leu Val 
145 150 155 160 

Thr He Asn Asp Glu Thr Tyr Leu His Tyr Thr Phe Pro Ser Val Asp 

165 170 175 

Val Ala Leu Leu Arg Gly Thr Tyr Ala Asp Gin Gin Gly Asn He Tyr 

180 185 190 

Leu Xaa Gin Glu Ala Tyr Leu Ser Glu Cys Tyr His Val Ala Leu Asn 

195 200 205 

Ala Lys Ala Asn His Gly Lys Val He Xaa Gin Val Lys Ala Leu Val 

210 215 220 

Asp Asp Tyr His Leu Lys Pro Asn Glu Val Val He Pro Gly Asn Leu 
225 230 235 240 

Val Asp Tyr Val Phe Val Thr Glu Asp Glu Lys Asn His Arg Gin Val 

245 250 255 

He Gin Ser His Tyr Leu Pro Ala Leu Ser 
260 265 

(2) INFORMATION FOR SEQ ID NO: 306: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 68 amino acids 
(B} TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:306: 

Val Val Lys He Val Xaa His Gin Lys Asp Ser Asn Xaa Glu Lys Asp 
15 10 15 
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Asp Lys Ala Arg Thr Xaa Gin Gin Asp Xaa Lys Xaa Xaa Asp Ser Ser.. 

20 25 30 

Xaa Asp Lys Lys Asp Asn Xaa Asp Asp Ser Xaa Asp Val Xaa Lys Asp 

35 40 45 

Asn Lys Asp Asn Ser Ala Asn Asp Asn Gin Gin Gin Ser Asn Ser Lys 

50 55 60 

Cys Asn Lys Gin 
65 

(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 

Val Arg Arg Phe Pro Asp lie Glu Arg Arg Val Gin Thr His Asp Phe 

1 5 10 15 

Thr Trp Gin Ala Glu lie Met Ser Lys Pro Val Lys Asn Met Thr Val 

20 25 30 

Ala He He Gly Thr Gly Arg He Gly Ala Ala Thr Ala Lys He Tyr 

35 40 45 

Ala Gly Phe Gly Ala Thr He Thr Ala Tyr Asp Ala Tyr Pro Asn Lys 

50 55 60 

Asp Leu Asp Phe Leu Thr Tyr Lys Asp Ser Val Lys Glu Ala He Lys 
65 70 75 80 

Asp Ala Asp He He Ser Leu His Val Pro Ala Asn Lys Glu Ser Tyr 

85 90 95 

His Leu Phe Asp Lys Ala Met Phe Asp His Val Lys Lys Gly Ala He 

100 105 110 

Leu Val Asn Ala Ala Arg Gly Ala Val He Asn Thr Pro Asp Leu Xaa 

115 120 125 

Ala Ala Val Asn Asp Gly Thr Leu Leu Gly Ala Ala He Asp Thr Tyr 

130 135 140 

Glu Asn Glu Ala Ala Tyr Phe Thr Asn Asp Trp Thr Asn Lys Asp He 
145 150 155 160 

Asp Asp Lys Thr Leu Leu Glu Leu He Gly Thr 
165 170 
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. (2) INFORMATION FOR SEQ ID NO: 308: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:308: 



Val Val Val Leu Met Cys Cys Leu 

1 5 
Glu Val Arg Ala Gin Leu Pro Asp 
20 

Asp Gly Asn Arg Glu Ser Ser Cys 

35 40 
Ala Gly Glu Ala 
50 



Lys Tyr Glu Asn Asp Tyr Tyr Glu 

10 15 
lie Gly Glu Ala He Glu Thr Pro 
25 30 
Phe Lys Tyr He Arg His Phe Tyr 
45 



(2) INFORMATION FOR SEQ ID NO: 309: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 09: 



Val Asn Thr Xaa Leu Leu Asn Xaa 

1 5 
He Pro Val Ser Val Asp Glu Asn 
20 

Asn Thr Leu Phe Thr He Leu Cys 

35 40 
Ser Asp Pro Val Val He Gly He 

50 55 
He Leu Xaa Val Ser Ala Glu Thr 
65 70 



Arg Val Glu Pro Ala He Val Lys 

10 15 
He Asp Asn Val Glu Lys Lys Leu 
25 30 
Val Val Asn Xaa Xaa Leu Phe Val 
45 

Asp Ala Phe Glu Asp Thr Arg Val 
60 

He Pro Gly Xaa Gly Phe Ser Gly 
75 80 
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Ala Arg He lie Arg Lys Glu Val His Lys. Met Phe Leu Gin Glu Gly Vi 

85 90 95 

He Lys Thr Pro Xaa Pro He Met Thr Pro Phe Asn His Ser Glu Xaa 
100 105 110 

Gly Val 



(2) INFORMATION FOR SEQ ID NO: 3 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 

Val Thr Gin Leu Leu Lys Asn Ala Xaa Lys Lys Xaa Asn Asp Leu He 

15 10 15 

Lys Glu Glu Ala Glu He Val Lys Asn Asp Asp Leu Pro Xaa Xaa Leu 

20 25 30 

lie He Xaa Gly Xaa Gin Tyr Asp Xaa Arg Trp Arg 
35 40 

(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:311: 



Val Arg Phe His Asp Gly Xaa Thr 

1 5 
Asn He Asp Ala Xaa Gin Gin Asn 
20 

He Ser Thr Leu He Asp Asn Val 
35 40 



Phe Asp Ala Asp Ala Val Lys Lys 

10 15 
Lys Lys Leu His Ser Trp Leu Lys 
25 30 
Lys Val Lys Asp Lys Tyr Thr Val 
45 
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Glu Leu Asn Leu Lys Glu Ala Tyr 

50 55 
Met Pro Arg Pro Tyr Val Phe Val 
65 70 
Thr Thr Lys Asp Gly Val Lys Lys 
85 

Leu Gly Glu His Lys Lys Asp Glu 
100 

Gin Tyr Trp Gly Glu Lys Ser 
115 



Gin Pro. Ala Leu Ala Glu Leu Ala.... 
60 

Ser Pro Lys Asp Phe Lys Asn Gly 

75 80 
Phe Asp Gly Thr Gly Pro Phe Lys 

90 95 
Ser Ala Asp Phe Asn Lys Asn Asp 
105 110 



(2) INFORMATION FOR SEQ ID NO:312: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 307 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3l2: 



val Pro He Asp Tyr He Xaa Gin Thr Gly Asp lie val Glu He Arg 

15 10 15 

Thr Ser Lys His Ser Tyr Gly Pro Ser Arg Asp Trp Leu Xaa He Val 

20 25 30 

Lys Ser Ser Ser Ala Lys Gly Lys He Lys Ser Phe Phe Xaa Xaa Gin 

35 40 45 

Asp Arg Ser Ser Asn He Glu Lys Ala Arg Met Met Val Glu Val Glu 

50 55 60 

He Lys Asp Gin Gly Phe Arg Val Glu Asp He Leu Thr Glu Lys Asn 
65 70 75 80 

He Gin Val Val Asn Xaa Lys Tyr Asn Phe Xaa Asn Glu Asp Asp Leu 

85 90 95 

Phe Ala Ala Val Gly Phe Gly Gly Val Thr Ser Leu Gin lie Val Asn 

100 105 110 

Lys Leu Thr Glu Arg Gin Arg He Leu Asp Lys Gin Arg Ala Leu Asn 

115 120 125 

Glu Ala Gin Glu Val Thr Lys Ser Leu Pro He Lys Asp Asn lie lie 

130 135 140 

Thr Asp Ser Gly Val Tyr Val Glu Gly Leu Glu Asn Val Leu He Lys 
145 150 155 160 



247 



EP 0 841 394 A2 



Leu Xaa Lys Cys Cys 
165 

lie Thr Lys Gly His 
180 

lie Lys Asn Glu Thr 
195 

Lys Asp Ala Thr Gin 
210 

Asp Arg Asn Gly Leu 
225 

Xaa Gly Asn Leu lie 
245 

Xaa lie lie Asn He 
260 

Arg Val Val Glu Lys 
275 

Lys Ser Leu Gly Thr 

290 
Lys Arg Gly 
305 



Asn Pro He Pro Gly Asp 
170 

Gly Thr Lys Val His Arg 
185 

Glu Arg Leu He Asn Val 
200 

Lys Tyr Gin Val Asp Leu 
215 

Leu Asn Glu Val Leu Gin 
230 235 
Lys Val Ser Gly Arg Ser 
250 

Ser Val Met Val Lys Asn 
265 

Thr Lys Gin Leu Gly Asp 
280 

Arg Gly Ser Lys Asn He 
295 



Asp He Val Gly Tyr,.. 

175 

Thr Asp Cys Pro Asn 
190 

Glu Trp Val Lys Xaa 
205 

Glu Val Thr Ala Tyr 
220 

Ala Val Ser Ser Thr 
240 

Asp He Asp Xaa Asn 
255 

Val Asn Asp Val Tyr 
270 

Val Tyr Thr Gly Asn 
285 

Glu Ser Arg Trp Tyr 
300 



(2) INFORMATION FOR SEQ ID NO: 313: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 



Val Ser Thr Val His Pro He Arg 

1 5 
Thr Val Asp He Val He Gly Arg 
20 

Xaa Val He Leu Asp Asn Gly Lys 

35 40 
He Ala Arg Leu Gly Tyr Ser Asp 

50 55 
Glu Met Lys Ala Pro Xaa Ala Ser 
65 70 



He Pro Pro Gly Asp Pro Val Ser 

10 15 
Val Ala Gin Val His He Ala Asp 
25 30 
Leu Asp He Lys Ser He Lys Pro 
45 

Tyr Thr Val Val Asn Glu He Phe 
60 

Lys Glu Glu Leu Ala Gly Leu Glu 
75 80 
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Gly Arg Asn Phe Asp Asn Gin Ser Asp Glu. Lys Arg 
85 90 

(2) INFORMATION FOR SEQ ID NO: 314: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



20 



25 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:314: 

Val Leu Thr Arg Leu Lys Thr Met Gly Lys Leu Ala Met Leu Met Leu 

1 5 10 15 

Lys Asp Leu Asp Gin Ala Phe Lys Lys Lys Asp Thr Val Leu lie Arg 

20 25 30 

Glu lie lie Glu Arg Asp Glu Asp lie Asp Asp Leu Tyr Ser His lie 

35 40 45 

lie Asn Ala Thr Tyr Leu lie Asp Asn Asp Pro Phe Val Ala Ala Gin 

50 55 60 

Ala His Leu Ala Ala Arg His Leu Glu Arg He Gly Asp His He He 
6 5 70 75 80 

Asn He Ala Glu Ser Val Tyr Phe Tyr Leu Thr Gly Thr His Tyr Glu 
85 90 95 

Gin He Thr 



(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 

Val Thr Asn Lys He Leu Ser Gin Phe Leu Asn He Lys Pro Pro Ser 
1 5 10 15 
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Val Ser Glu Met Val Gly Arg Leu Glu Lys Ala Gly Tyr Val Glu Thr,^ 

20 25 30 

Lys Pro Tyr Lys Gly Val Arg Leu Thr Glu Asp Gly Leu Thr His Thr 

35 40 45 

Leu Asp lie lie Lys Arg His Arg Leu Leu Glu Leu Phe Leu lie Glu 

50 55 60 

lie Leu Lys Tyr Asn Trp Glu Glu Val His Gin Glu Ala Glu lie Leu 
65 70 75 80 

Glu His Arg lie Ser Asp Leu Phe Val Glu Arg Leu Asp Ser Cys 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 316; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:316: 

Val Ser Gly Asn Xaa Xaa Gin Ala Asp Leu lie Gly Tyr Ser Phe Lys 

1 5 10 15 

Phe Asp Gly Ala lie Xaa Arg Gin Glu Ala Ser Xaa Asp Val His Ala 

20 25 30 

Val lie Leu Ser Asn Lys Thr Leu 
35 40 



(2) INFORMATION FOR SEQ ID NO: 3 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 



Val Val Ser Met Lys Tyr Asn Thr Asn Val Lys His Thr Thr Leu Glu 
1 5 10 15 
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Ala Phe Val Thr Thr Val Asn Asp 
20 

Glu Ala Leu Arg Glu Val Arg Lys 

35 40 
Asp Ala Leu Val Asn Lys Asp Glu 

50 55 
Glu Tyr Lys Asn Leu Glu Ala Phe 
65 70 



Leu Gly lie Glu Leu He He Asn... 
25 30 
Arg Gin Leu He Glu Leu He Asp 
45 

Ala Ala Phe Asn Gin Tyr Thr Ala 
60 

Leu Gly Xaa 
75 



(2) INFORMATION FOR SEQ ID NO: 318: , 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 



Val Tyr Lys Val Lys Val Ala Val 

1 5 
Lys He Met Gin Glu Ser Cys Asn 
20 

Tyr Glu Lys Gin Val Val Ser Ala 

35 40 
Gin Phe Ala Ser Glu Ala Arg Glu 

50 55 
Gly Ala Gly Gly Ala Xaa His Leu 
65 70 
Thr Leu Pro Val lie Gly Val Pro 
85 

Tyr 



He Met Gly Ser Ser Ser Asp Trp 

10 15 
Met Leu Asp Tyr Leu Glu He Pro 
25 30 
His Arg Thr Pro Lys Met Met Val 
45 

Arg Gly He Asn He He He Ala 
60 

Pro Gly Met Val Ala Ser Leu Thr 

75 80 
He Glu Thr Xaa Lys Phe Lys Gly 
90 95 



(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 319: 

Val Lys Lys lie Asp Cys Ser Xaa lie Gly lie Cys Gly Tyr Glu Tyr 

1 5 10 15 

Arg Gin Leu Lys Gin Glu Thr Xaa Leu Xaa Leu Phe Xaa Lys Leu His 

20 25 30 

Pro Ser Phe Xaa Gly Gin Gly Val Arg Met 
35 40 

(2) INFORMATION FOR SEQ ID NO: 32 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 0: 

Val Leu Gly Leu lie Glu Gly Lys Ser Gin Arg Lys Ala Tyr He Asp 

15 10 15 

Ala Gly Tyr Ser Thr Lys Gly Lys Ser Asp Asn Tyr He Asp Ser Arg 

20 25 30 

Ala Phe Glu Leu Ser Lys Asn Ser Ala Gly Leu Asp Arg Tyr Glu Glu 

35 40 45 

Leu Arg Gin Glu Ala Gly 
50 

(2) INFORMATION FOR SEQ ID NO: 321: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 321: 
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10 



Val Lys Asp Leu Leu Gin Ala Gin Gin Lys. JLeu lie Pro Asp Leu Ile v 

15 10 15 

Asp Lys Met Tyr Lys Arg Phe Ser lie Leu Thr Thr lie Ser Lys Asn 

20 25 30 

Gin Pro Val Gly Arg Arg Ser Leu Ser Glu His Met Asp Met Thr Glu 

35 40 45 

Arg Val Leu Arg Ser Glu Thr Asp Met Leu Lys Lys Gin Asp Val lie 

50 55 60 

Lys Val Lys Pro Thr Gly Met Glu lie Thr Ala Glu Gly Glu Gin Leu 
65 70 75 80 

lie Ser Gin Leu Lys Gly Tyr Xaa Asp lie Tyr Gly Asp Asp Asn Arg 

85 90 95 

Leu Val Lys Lys Gly lie 
100 

(2) INFORMATION FOR SEQ ID NO: 322: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: Protein 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 

Val Met He Met Arg Lys Glu He Glu Ala Leu He Phe Ser Asp Val 

15 10 15 

Ser Ser Tyr Asp He Tyr Val Asn Thr Gly Val Asn Gin Gly Leu He 

20 v 25 30 

Gly Asp He Lys Asp Gly Tyr Leu Thr He Asp Ser Met Pro Tyr He 

35 40 45 

Asp Ala Glu Arg Leu Tyr His Phe Ala Tyr Gly Thr 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 323: 



so (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
55 <D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 

Val Thr Cys He Val Leu Val Lys Val Val Glu Asn Ala Glu He Lys 

15 10 15 

Gly He Glu He Phe Asn Ser Val Asp Asn Asn Asn Lys Lys He Phe 

20 25 30 

Leu Lys Asp Gly Asn Val Val Gly Ala Val Ser Cys Gin Tyr Gly Asp 

35 40 45 

He Asp Asp Gly Ser Arg Phe Tyr Asn Met Met Lys Lys Gly Glu Ser 

50 55 60 

Thr Glu Asp Tyr Thr Leu Val Ser Leu Leu Thr Lys Gly Gly Glu Glu 
65 70 75 80 

Ala Ser Leu Ser He Ala Asp Met Ala Asp Asp Glu Thr He Cys Gly 

85 90 95 

Cys Asn Gly Val Asp Lys Gly Thr He Val Asn Ala He Thr Glu Asn 

100 105 110 

Gly Phe Thr Thr Val Glu Glu Val Thr Ala Lys Thr Lys Ala Gly Asn 

115 120 125 

Ser Cys Gly Lys Cys Lys Pro Gin He Ala Gin He Leu Gin His Thr 

130 135 140 

Leu Gly Asp Asp Phe Val Ala Ala Lys Pro Ala Gly He Cys Gly Cys 
145 150 155 160 

Thr Asp Leu Thr Arg Asp Gin He Val Thr Gin lie Arg Ala Lys Gly 

165 170 175 

Leu Lys Thr Ser Lys Glu Val Arg His Val Leu Asn Phe Lys Asn Lys 

180 185 190 

Gly Gly Cys Pro Lys Cys Arg Pro Ala He Asn Tyr Tyr Leu Asn Met 

195 200 205 

Val Tyr Pro His Asp His Glu Asp Glu Arg Glu Ser Arg Phe Ala Asn 

210 215 220 

Glu Arg Tyr His Ala Asn He Gin Asn Asp Gly Thr Phe Ser Val He 
225 230 235 240 

Pro Gin Met Arg Gly Gly Val Thr Asp Ala Asp Gin Leu He Arg Leu 

245 250 255 

Gly Glu Val Ala Lys Lys Tyr His Val Pro Leu Val Lys Val Thr Gly 

260 265 270 

Ser Gin Arg Val Gly Leu Tyr Gly Gly 
275 280 

(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 
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-(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 324: 



Val Pro Met Thr Pro Leu Val Glu 

1 5 
Asn Asn Lys Ser Asn Ser Asn Ser 
20 

Leu Glu Asn Arg Gly Trp Asp Phe 

35 40 
Phe Asp Leu Xaa Asn Val Phe Trp 

50 55 
Leu Lys Gly Phe Trp Arg 
65 70 



Pro Xaa Pro Xaa Thr Val Ser Ser 

10 15 
Ser Thr Leu Xaa Tyr Leu Arg Thr 
25 30 
Asp Gly Ser Tyx Gly Trp Gin Cys 
45 

Xaa Ser Ser Ser Met Gly Met Asp 
60 



(2) INFORMATION FOR SEQ ID NO: 325: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:325: 



Val Leu Gin Leu lie Thr Gin lie 

1 5 
Trp Val Asn Gin Leu Val Ser Xaa 
20 

Leu Arg Thr Xaa Ser Asp Trp lie 

35 40 
Asn Phe lie Lys Asp Pro Tyr Ser 

50 55 
Leu Tyr His Thr Ala Tyr Tyr Met 
65 70 
Met Lys Met Leu Asn His Ala Met 
85 



Tyr Gly Ala Glu Lys Arg Val Xaa 

10 15 
Ser Phe Asn Lys Lys lie Arg Pro 
25 30 
Ser Ser Asn Pro lie Glu Val Asp 
45 

Gly Phe Asn Val Ser Xaa Gin Leu 
60 

Leu His Thr Ser Gin Leu Lys Asn 

75 80 
Pro lie Leu Leu Val Ser Gly Tyr 
90 95 
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Ala Asp Ser Leu Gly Asp Tyr Gly Lys Gly lie Leu Lys Leu Ala Asn,,, 

100 105 110 

He Tyr Arg Glu Ala Gly lie Lys Asn Cys 
115 120 

(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

Val Thr Leu Glu Val Ala Glu Met Xaa Asn Ser Xaa Lys Asp Xaa Lys 

15 10 15 

Lys Phe Glu He He Asp Arg Xaa Lys Ser Phe Tyr Asp Glu Glu Gin 

20 25 30 

Xaa He Asp Leu Val Phe Val Val Asn Gin He Asn Gly Trp Asn Arg 

35 40 45 

Leu Asn He He Ser Asp Arg Leu 
50 55 

(2) INFORMATION FOR SEQ ID NO;327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 

Val Asp Ser Val Thr He Lys Val Leu Asn Glu Pro Ser Pro Lys Leu 

15 10 15 

Leu Thr Thr Trp Tyr Ala Glu Gin Val Thr Gin Gly Lys He Lys Thr 

20 25 30 

Ser Lys Tyr Val Lys Lys Glu Cys Glu Arg His Leu Arg Tyr Leu Glu 
35 40 45 
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Asn Gly Gly Lys Trp Val Phe Asp Glu Glu. Leu Ala His Arg Pro I lew. 

50 55 60 

Arg Phe lie Glu Lys Phe Cys Lys Xaa Ser Lys Gly Ser Lys Arg Gin 
65 70 75 80 

Leu Ala Leu Gin Pro Trp Gin His Phe lie lie Gly Ser Leu Phe Gly 

85 90 95 

Trp Val His Lys Glu Thr Lys Leu Arg Arg Phe Lys Glu Ala Leu lie 

100 105 110 

Phe Met Gly Ala Lys Lys Trp 
115 

(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 8: 

Val He Xaa Xaa Val He Lys Glu Lys Pro Xaa Ala Arg Trp Leu Phe 

15 10 15 

Ser Pro Xaa Ser Pro Xaa Xaa Ala He Asp Gly Asp Thr Leu Val Thr 

20 25 30 

Lys Xaa Xaa Ala Ser Asn 
35 

(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 

Val Leu Asn Leu Ser He Ser Leu Thr Ser Leu Ala Thr Gly He Phe 
15 10 15 
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Met Val Ala Ala Gly Asp 
20 

Thr Tyr Met Gly Leu lie 
35 

lie Ser Asp lie Thr Ala 
50 

Ser lie Ser Xaa Tyr Leu 
65 70 
Asn Gin Phe Lys Gly Glu 
85 

lie ser Thr Val Gly Gly 
100 

lie Ala Thr Asn Phe Gly 
115 

He Xaa Phe He Ala He 
130 

Ser Gin His Xaa His Arg 
145 150 
Phe Ala Val Met He Gly 
165 

Glu Gin Gly Trp Phe Ser 
180 

He He Thr Thr Leu He 
195 

Pro Phe Tyr 
210 

(2) INFORMATION FOR SEQ ID NO:330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

Val Ala Pro Phe Leu Tyr Leu Xaa He Asn Asp Glu Xaa Xaa Asp Met 

15 10 15 

Lys He Ala Gly Asp Leu Asn Ala Tyr Asn Ala Leu Xaa Ala Tyr Thr 
20 25 30 



258 



He Ala Asp Lys. He Gly Gin Xaa Arg Met.,. 

25 30 
He Xaa Met Phe Xaa Ser Leu Leu Leu He 

40 45 
Leu Leu He He xaa Arg Asn Phe Thr Arg 
55 60 

Gly Tyr Pro Pro Thr Val Gly Val Leu Asn 

75 80 
His Leu Arg Arg Ala He Ser Tyr Leu Met 

90 95 
He Gly Leu Ala Gly Val He Gly Gly Leu 

105 110 
Trp Gin Met Asn Phe He He Ser He Val 

120 125 
Leu Leu Leu Lys Gly Thr Pro Glu Lys Val 
135 140 

His Pro Phe Asp Tyr Lys Gly Met Ser He 
155 160 
Ser Phe Thr Leu Leu Leu Thr Gin Gly Phe 

170 175 
Thr Phe Ser Phe He Cys Leu Ser He Phe 

185 190 
Phe He He He Glu Arg Arg His Glu Val 
200 205 
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Val Xaa Arg Glu Leu Gly Leu Asn Glu Gin. Xaa Asn 
35 40 

(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

Val Tyr Val lie Ser Leu Asn Asp Asn Ala Ala Asp Gly Arg Asp Thr 

1 5 10 15 

Ser Trp He Tyr Asp Ala Asp Phe Gly Lys He He 
20 25 

(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 

Val Met Ser Lys Leu Arg Pro Leu Leu His Tyr Ala Arg Ala Lys Asp 

1 5 10 15 

He Asp Asn Tyr Arg Thr Val Glu Glu Ser Tyr Arg Gin Gly Gin Tyr 

20 25 30 

Tyr Arg Xaa Xaa He Val Asp Gly Lys Leu Asn He Gin Phe Asn Glu 

35 40 45 

Gly Glu Pro Tyr Phe Gly Arg His 
50 55 

(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 
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10 



15 



20 



30 



35 



40 



45 



55 



. . . <A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

Val His Pro Asp Val Leu Leu Thr Arg Tyr Val Glu Gly Lys Glu Asp 

15 10 15 

Gin Val Glu Lys Val Leu Tyr Gin Leu Ala Asp lie Asp lie Ser Glu 

20 25 30 

lie Pro Lys Asp Phe lie Leu Leu Pro Thr His Pro Tyr Gin Xaa Asn 

35 40 45 

Val Leu Xaa Gin Tyr Pro Gin Tyr Met Gin Tyr Ser Glu Gin Gly Leu 

50 55 60 

lie Lys Asp Leu Gly Val Ser Gly Asp Leu Val Ser Pro Thr Ser Xaa 
65 70 75 80 

Xaa Arg Thr Val Phe Ser Lys Ala Leu Asn lie Tyr Leu Xaa Xaa Pro 

85 90 95 

He His Val Xaa He Thr Asn Phe Xaa Arg Thr Asn Asp Leu Glu Gin 

100 105 110 

He Xaa Arg Thr Ser Cys Met Pro Arg Met Ser Ser Ser Ser Asp Gin 
115 120 125 

Arg 



(2) INFORMATION FOR SEQ ID NO: 33 4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 113 amino acids 
{B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

Val Gin Asn Lys Glu Leu He Gin His Ala Ala Tyr Ala Ala He Glu 

15 10 is 

Arg He Leu Asn Glu Tyr Phe Arg Glu Glu Asn Leu Tyr Gin Val Pro 
20 25 30 
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.. Pro. Gin Asn His Gin Trp Ser lie 
35 40 
Thr Gly Glu Phe Arg Tyr Trp Ser 

50 55 
Pro Glu Val Trp Leu lie Asp Gly 
65 70 
Lys Glu Ala lie Ala Arg lie Leu 
85 

Asn Gin Thr Ala Val Gin Pro Thr 
100 

Arg 



Gin Leu Ser Glu Leu Glu Thr Leu 
45 

Ala Met Gly His His Met Tyr His 
60 

Lys Ser Lys Lys lie Thr Thr Tyr 

75 80 
Pro His Met Ala Gin Ser Ala Asp 

90 95 
Tyr Gly Xaa Asn Tyr Val Xaa His 
105 110 



(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 5: 

Val Ala Val Ser Phe Leu Thr Ser Arg Asp lie Pro Xaa Asp Lys Val 

15 10 15 

Val Lys Met Asp Val Asp Pro Pro Xaa Tyr Leu 
20 25 

(2) INFORMATION FOR SEQ ID NO: 33 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 

Val Lys Asn Val Lys Thr Gin Ser lie Gin Asn Ala Ser lie val Thr 
15 10 15 



261 



EP 0 841 394 A2 



.. Val Glu Tyr Glu Asn 
20 

Lys Lys Glu He Asp 
35 

Glu Leu Arg Arg Asn 
50 

Phe Gin He Lys Arg 



Asn Thr Asp Met Asp .Lys 
25 

Lys He Lys Phe Lys Asp 
40 

Ser Met Asp Ala Phe Pro 
55 



Ala Glu Glu Gin Leu 
30 

Glu Val Gly Gin Pro 
45 

Val Leu Ala Tyr Ser 
60 



(2) INFORMATION FOR SEQ ID NO: 3 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 



35 



40 



50 



Val Met His Xaa He Phe Pro Val Val Ser Xaa Thr Phe Ala Ser Phe 

15 10 15 

Lys Ser Met Tyr Gly Gly He Pro Gin Asp Phe He Asp Tyr Leu Phe 

20 25 30 

He Asp Glu Ala Gly Gin Ala He Pro Gin Ala Ala Val Gly Ala Leu 

35 40 45 

Tyr Arg Ser Lys Lys Val Val Ala Val Gly Asp Pro He Gin He Glu 

50 55 60 

Pro Val Val Thr Leu Glu Ser His Leu He Asp Asn He Arg Lys Asn 
65 70 75 80 

Tyr His Val Pro Glu Tyr Leu Val Ser Lys Glu Ala Ser Val Gin Ser 

85 90 95 

Val Ala Asp Asn Ala Asn Gin Tyr Gly Phe Trp Lys Ser Asp Ala Thr 

100 105 110 

Asp Ser Asn Gin Lys Thr Trp He Gly He Pro Leu Trp Val His Arg 

115 120 125 

Arg Cys Leu Lys Pro Met Phe Thr He Ala Asn Gin He Ala Tyr Asn 

130 135 140 

Asn Lys Met Val Leu Ala Ser Asn He Thr Lys Val Gly Lys Thr Gly 
145 150 155 160 

Trp Tyr Asp Val Lys Gly Asn Ala Val Gin Asn He Xaa Glu Arg Ala 
165 170 175 



262 



EP 0 841 394 A2 



Trp Val Lys Lys 
180 

(2) INFORMATION FOR SEQ ID NO: 33 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 8: 

Val Lys Phe Phe Asp Tyr Thr Phe Ala Ser Val Gly Val Lys Pro Asn 

15 10 15 

Glu Leu Lys Gin Phe Asp Tyr Lys Met Val Glu Val Thr Gin Gly Ala 

20 25 30 

His Ala Asn Tyr Tyr Pro Gly Asn Ser Pro Leu His Leu Arg Val Tyr 

35 40 45 

Tyr Asp Thr Ser Asn Arg Gin He Leu Arg Ala Ala Ala Val Gly Lys 

50 55 60 

Glu Gly Ala Asp Lys Arg He Asp Val Leu Ser Met Ala Met Met Asn 
65 70 75 80 

Gin Leu Thr Val Asp Glu Leu Thr Glu Phe Xaa Val Xaa Tyr Ala Pro 

85 90 95 

Pro Tyr Trp His Pro Lys Asp Leu Xaa Asn Met lie Gly Leu Gin Ser 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 

Val Met Gin Ser Ser Lys Trp Asn Ala Met Ser Leu Leu Met Asp Glu 
15 10 15 
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Lys Thr Lys Gin Ala Glu Val Leu 
20 

Ala lie Val lie Gly He Gly Ala 

35 40 
Thr Tyr Val Gly Glu Arg Phe Thr 

50 55 
Lys Tyr Arg Phe Phe Asp Met Leu 
65 70 
Ser Trp Gin Glu Tyr Trp Ala Phe 
85 

Tyr 



Arg Thr Ala He Asp Glu Ala Asp., 
25 30 
Gly Met Ser Ala ser Asp Gly Phe 
45 

Glu Asn Phe Pro Asp Phe He Glu 
60 

Gin Ala Ser Leu His Pro Tyr Gly 

75 80 
Glu Ser Arg Phe He Thr xaa Asn 
90 95 



(2) INFORMATION FOR SEQ ID NO: 340: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 



Val Tyr Pro He Lys Ala He He 

1 5 
He Glu His Arg Leu Gin Tyr Val 
20 

Tyr Asn Asp Ser Xaa Ala Thr Asn 

35 40 
Asn Ser Phe Asn Gin Pro He He 

50 55 
Glu Gly Asp Glu Phe Asp Glu Leu 
65 70 
His Asp Gly Cys He Arg Thr Asn 
85 



Asp Ser Leu Thr Thr Phe Ser Gly 

10 15 
Gly Thr Asn Arg Thr Asn Lys Tyr 
25 30 
Thr Leu Ala Thr Gin Phe Ala Leu 
45 

Trp Leu Leu Trp Trp Tyr Trp He 
60 

He Pro Tyr Met Glu Asn Val Xaa 

75 80 
Glu Ser Xaa Val Cys 
90 



(2) INFORMATION FOR SEQ ID NO: 341: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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.(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 

Val Met Gly Thr Asp Lys Arg Val Ser Pro Asp Leu Phe He Lys Arg 

15 10 15 

Leu Asn Phe Tyr Asp Tyr Leu Leu Leu Asn Ser Asp Gly Leu Thr Asp 

20 25 30 

Tyr Val Lys Asp Asn Glu He Lys Arg Leu Leu Val Lys Glu Gly Thr 

35 40 45 

He Glu Asp His Gly Asp Gin Leu Met Gin Leu Ala Leu Asp Asn His 

50 55 60 

Ser Lys Asp Asn Val Thr Phe He Leu Ala Ala He Xaa Gly Asp Lys 
65 70 75 80 

Val 



(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

Val Lys Leu Lys Gly Met Arg Lys Gin Tyr Glu Val Gly Glu Gin Ala 

15 10 15 

Ser Pro He Glu Phe Val Thr Gly Arg Val Asn Pro His Arg 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 343: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(iX) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

Val Glu Val Gin Val Met Lys Phe Gly Lys Thr lie Ala Val Val Leu 

15 10 15 

Ala Ser Ser Val Leu Leu Xaa Gly Cys Thr Thr Asp Lys Lys Glu lie 

20 25 30 

Lys Ala Tyr Leu Lys Gin Val Asp Lys He Lys Asp Asp Glu Glu Pro 

35 40 45 

He Xaa Thr Val Xaa Lys Lys He Ala Glu Leu Asp Glu Lys Lys Lys 

50 55 60 

Lys Leu Thr Glu Asp Val Asn Ser Lys Asp Thr Ala Gly Ser Arg Xaa 
65 70 75 80 

Lys Gin 



(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

Val He Gin Ser Ser Ser Leu Leu Asn Xaa Phe Lys Val Phe Ala Ala 

15 10 15 

He Ser Asp Thr Val He His Gly Ser Asp Val Asp Leu Val Asn Trp 

20 25 30 

Val Glu Asn Ala Leu Gin Ser Gly Leu Ala Ala Lys Glu Arg Lys Asp 

35 40 45 

Ala Leu Ala Val Ser Pro Lys Ser Gly Leu Glu His Met Ser Phe Tyr 

50 55 60 

Asn Gly Ser Val Lys Glu Val Xaa Gly Ala Asp Met Tyr Asp Ala He 
65 70 75 80 

He Asn Ala Leu Ala Asp Leu His Glu Asp Tyr Arg Asp Asn Ala Thr 

85 90 95 

lie Tyr Met Arg Tyr Ala Asp Tyr Val Lys He He Ser Val Leu Ser 
100 105 110 
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Asn Gly .Thr Thr Asn Phe Phe Asp Thr Pro Ala Glu Lys Val Xaa Gly., 

115 120 125 

Xaa Pro Val Gly lie Tyr Arg Cys Arg Xaa Leu Pro Leu Leu Trp Glu 

130 135 140 

Asp Phe Asn Tyr Cys Gly Xaa Tyr Leu Leu Met Gly Gin Pro Tyr Asp 
145 150 155 160 

Thr Asp 



(2) INFORMATION FOR SEQ ID NO: 3 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:345: 

Val Tyr Lys Thr Asp Leu Tyr Xaa Gin Xaa Xaa Lys lie Asp Thr Val 

15 10 15 

Lys Val lie Lys Glu Val Glu Leu Lys Lys Pro He Thr Tyr Glu Ala 

20 25 30 

Gly Ala Thr Ser Asp Ser Lys Leu Xaa Lys Glu Trp Met Asp Ser 

35 40 45 

(2) INFORMATION FOR SEQ ID NO: 3 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 

Val Asn Pro Arg Tyr Lys Xaa Thr Leu Leu His Arg Arg Leu His Thr 

15 10 15 

He Leu Xaa Lys Met Leu Glu Asp Leu Ser Phe Glu Ala Pro Ser Met 
20 25 30 
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Pro Asn Ala Val Val Asp lie Thr Pro Gin Tyr Val Asp Asp Lys Leu.... 

35 40 45 

Lys Ser lie Ser Thr Xaa Lys Asp Leu Ser Ala Phe lie Leu 
50 55 €0 

(2) INFORMATION FOR SEQ ID NO: 3 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

{ C ) STRANDEDNESS : s ingle 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:347: 

Val Lys He Leu Lys Asp Thr Asn Leu Met Tyr Leu Glu Trp Asn Tyr 

15 10 15 

Leu Val Xaa Arg Val Val Ala Val Val Glu Xaa Gin Gly Thr Leu Ser 

20 25 30 

Asp Ala Val Asn Lys Ala Leu Gin Tyr Trp Val Ser His Val Asp Asp 

35 40 45 

Thr His Tyr Leu Leu Gly Ser Ala Leu Gly Pro Asp Pro Phe Pro Thr 

50 55 60 

He Val Arg Asp Phe Gin Ser Val He Gly Lys Glu Xaa Lys Ser Gin 
65 70 75 80 

He Leu Lys Lys Glu Gly Arg Leu Pro Asp Ala He Val Ala Cys He 

85 90 95 

Gly Gly Gly Ser Asn Ala He Gly Leu Val Pro Phe He His Leu Leu 

100 105 110 

Lys Met Met Leu His 
115 

(2) INFORMATION FOR SEQ ID NO: 348: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID_NO:348: 



Val Gly Arg Leu Pro Lys Lys lie Gin Asn lie Glu Gin Arg Pro Xaa 

15 10 15 

Gly Val ser Xaa Thr Thr Phe Thr lie Ala Val Asn Arg Thr Phe Thr 

20 25 30 

Asn Ala Gin Gly Glu Arg Glu Ala Asp Phe lie Asn Cys Val Thr Phe 

35 40 45 

Arg Lys Gin Ala Glu Asn Val Asn Asn Tyr Leu Ser Lys Gly Ser Leu 

50 55 60 

Ala Gly Val Asp Gly Arg Leu Gin Ser Arg Ser Tyr Asp Asn Lys Glu 
65 70 75 80 

Gly Arg Arg Val Phe Val Thr Glu Val Val Ala Asp Ser Val Gin Phe 

85 90 95 

Leu Glu Pro Lys Asn Asn Asn Gin Xaa Gin Asn Asn Asn Tyr Gin Gin 

100 105 110 

Gin Gly Gin Ala Gin Thr Gly Asn Asn Pro Phe Asp Asn Ser Glu Glu 

115 120 125 

Asp Phe Ser Asp Leu Pro Phe 
130 135 



(2) INFORMATION FOR SEQ ID NO: 349: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 



Val Val Ser Gin Ser lie Leu Leu 

1 5 
Val Thr Xaa Leu Thr Ser Xaa Ala 
20 

Asn Met lie Thr Gly Ala Ala Gin 

35 40 
Ser Ala Ala Asp Gly Pro Met Pro 

50 55 
Ser Arg Asn Val Gly Val Pro Ala 
65 70 



Thr Leu Ser Thr Xaa Leu Thr Asn 

10 15 
Gin Xaa Pro Ala Asp Tyr Val Lys 
25 30 
Met Asp Gly Gly He Leu Val Val 
45 

Gin Thr Arg Glu His He Leu Leu 
60 

Leu Val Val Phe Leu Asn Lys Val 
75 80 
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Asp Met Val Asp Asp Glu Glu Leu 
85 

Arg Asp Leu Leu Ser Glu Tyr Asp 
100 

lie Ala Gly Ser Ala Leu Lys Ala 
115 120 
Glu Lys He Leu Glu Leu Met Glu 

130 135 
Gin Thr 
145 



Leu Glu Leu Val Glu Met Glu Val v . 

90 95 
Phe Pro Gly Asp Asp Val Pro Val 
105 110 
Leu Glu Gly Asp Ala Gin Tyr Glu 
125 

Ala Val Asp Thr Tyr He Pro Leu 
140 



(2) INFORMATION FOR SEQ ID NO: 350: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:350: 

Val Lys Val Ser Gly Cys Ser Asp Leu Val Gin Cys Ala He His Asp 

15 10 15 

Cys Gin Tyr Tyr Glu Val Gin Met Pro Gin Ala His Arg Val Val Pro 

20 25 30 

Thr Thr He Asn Met Gly Asn His Ser Trp Lys Ala Leu Glu He He 

35 40 45 

Tyr Glu Thr Tyr Cys Thr Leu Cys Asp Ser Xaa Leu Asn Lys Xaa Thr 
50 55 60 

Thr 
65 



(2) INFORMATION FOR SEQ ID NO: 351: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 
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<XX> SEQUENCE DESCRIPTION: SEQ ID _NO : 351 : 



10 



20 



Val Xaa Val Asn He Asn Cys Ala Asn Pro Glu Cys Asn Lys Gin He 

15 10 15 

Leu Val Ser Glu Glu Xaa Glu Thr Lys Tyr Leu Gly Ala Cys Ser Tyr 

20 25 30 

Glu Cys Ala Lys His Glu Arg Asn Arg Tyr Val Gin Ala Asn Asn He 

35 40 45 

Ser Asp Asn Glu Trp Gin Gin Arg Leu Thr Asn Phe Asp Asp Leu His 

50 55 60 

Gin His Ala 
65 

(2) INFORMATION FOR SEQ ID NO: 352: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

2S fC) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:352: 

Val Thr Arg Ser Lys Met He Val Val Glu Ser Tyr Asp Val Asn Gly 
35 1 5 10 15 

Asn Lys Val Glu Leu Thr Ala His Glu Asp Val Ala Arg Met He Leu 

20 25 30 

His He He Asp Gin Met Asn Gly Leu Pro Phe Xaa Glu Arg Ala Asp 
40 35 40 45 

Arg He Leu Thr Asp Lys Glu Val Glu Ala Tyr Phe He Asn Asp 
50 55 60 

45 (2) INFORMATION FOR SEQ ID NO: 3 53: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 amino acids 
so (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: Protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: , 

Val Thr Lys Asn Ala Gly Asn His Glu lie Glu lie Asp Arg Pro He 

15 10 15 

Val Asn Pro He Lys Lys Leu Pro Thr Arg Phe Thr He Asn Val Ala 

20 25 30 

He Gly Asn Phe Glu Val Asn Leu 
35 40 



(2) INFORMATION FOR SEQ ID NO: 3 54: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 



Val Lys Gly Gin Asn Leu Thr His Gly Thr Asn Ala He Asp Tyr He 

15 10 15 

Thr Phe Asp Pro Asn Thr Asn Thr Asn Gly He Thr Ala Ala Trp Ala 

20 25 30 

Asn Arg Gin Gin Pro Asn Asn Gin Gin Ala Gly Val Gin His Leu Asn 

35 40 45 

Val Asp Val Thr Tyr Pro Gly He Ser Ala Ala Lys Arg Val Pro Val 

50 55 60 

Thr Val Asn Val Tyr Gin Phe Gly He Pro Ser Asn Tyr Leu Tyr Asn 
65 70 75 80 

Asn Ser Trp Trp His Phe Ser Lys Trp Tyr Ala Ser He Arg He Cys 

85 90 95 

Thr Tyr Ala Lys Arg Phe Arg Phe Thr Asn Arg Trp He Tyr Val 
100 105 110 



(2) INFORMATION FOR SEQ ID NO: 355: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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Ui) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:355: 



Val Thr Thr Gin Asn Gin Ala lie Asp Asn Thr Thr Gly Ala Thr Thr 
15 10 15 

1Q Glu Glu Lys Asn Ala Ala Lys Asp Leu Val Leu Xaa Ala Lys Glu Lys 

20 25 30 

Ala Xaa Gin Asp lie Leu Xaa Ala Gin Xaa Thr Asn Asp Val Thr Gin 
35 40 45 

1S lie Asn Asp Gin Ala Val Ala Asp lie Gin Gly lie Thr Gly Asp Thr 

50 55 60 

Thr lie Xaa Asp Val Ala lie Asp Xaa Xaa Ala Thr Pro Ala Thr Xaa 
65 70 75 80 

20 Xaa Xaa Ala Leu Xaa Gly His Xaa Ala Xaa Ala Thr Xaa Glu Xaa Xaa 

85 90 95 

Xaa Gin Xaa Xaa Xaa Thr Lys Xaa Xaa Ala Gin Xaa Xaa Thr Xaa Val 
100 105 110 

2S Phe Thr Xaa Leu Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val Xaa 

115 120 125 

Xaa Leu Ala Xaa Xaa Xaa Arg Thr Leu Xaa Ser Arg Xaa Xaa Xaa His 
130 135 140 

30 Xaa Xaa Xaa Xaa Xaa Asn Arg Phe 

145 150 

(2) INFORMATION FOR SEQ ID NO: 3 56: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

40 (C) STRANDEDNESS: single 

<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 



45 



Val Val Ser Leu He Asp Asp Glu 
so 1 5 

Leu Val Val Ser Ala Pro Ser Lys 
20 

Thr Glu Thr Thr Thr Gin Val Xaa 
55 3 5 4 0 



Asp Asp Asn Xaa Asn Asp Arg Gin 

10 15 
Lys Pro Thr Thr Pro Thr Thr Tyr 
25 30 
Met Pro Thr Val Glu Arg Gin Thr 
45 
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Gin Gin (Gin lie Val Tyr Lys Thr Pro Lys. Thr lie Ser Trp lie Lys v , 
50 55 60 

Trp 
65 



(2) INFORMATION FOR SEQ ID NO: 357: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 57: 



Val Lys lie Met Glu Met Ser Val Thr Glu Val lie Phe Ser Phe Leu 

15 10 15 

Gly Gly Leu Gly lie Phe Leu Tyr Gly Leu Lys lie Met Gly Asp Gly 

20 25 30 

Leu Gin Ala Ser Ala Gly Asp Arg Leu Arg Asp lie Leu Asn Lys Phe 

35 40 45 

Thr Ser Asn Pro val Leu Gly Val lie Ala Gly lie Val val Thr lie 

50 55 60 

Leu He Gin Ser Ser Ser Gly Thr Thr Val He Thr He Gly Leu Val 
65 70 75 80 

Thr Ala Gly Phe Met Thr Leu Lys Gin Ala He Gly Val He Met Gly 

85 90 95 

Ala Asn He Gly Thr Thr Val Thr Ala Phe He He Xaa He Arg Phe 

100 105 110 

Arg Arg He Cys Lys Cys Gin Phe 
115 120 



(2) INFORMATION FOR SEQ ID NO: 358: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 358: 

Val Leu Cys Asp Ala Asp Val Leu Ala Leu lie Asp lie Asp Val Asp 

15 10 15 

lie Asp Val Leu Lys Glu Pro Asp Ala Leu Val Leu Val Asp Cys Leu 

20 25 30 

Val Thr Leu Asn His Xaa Met Met 
35 40 

(2) INFORMATION FOR SEQ- ID NO:359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:359: 

Val Leu Val Glu Leu Glu Ser Leu Thr Leu Val Glu Pro Asp lie Glu 

15 10 15 

Ser Leu Lys Leu Val Asp Val Glu Thr Asp Thr Leu Pro Leu lie Glu 

20 25 30 

Ser Asp Val Glu Ser Asp Val Leu Val Glu Phe Asp Pro Leu Met Leu 

35 40 45 

Asp Glu Ser Leu Val Asp lie Glu Ser Leu Ser Asp Ala Leu Met Leu 

50 55 60 

He Glu Ser Asn 
65 

(2) INFORMATION FOR SEQ ID NO: 3 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 
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20 



Val He Ser Lys Xaa Gin Trp Val Val Val.His Phe Tyr Pro Ser Trp.,. 

1 5 10 15 

Lys Asp Ala Glu Gly His Phe Glu Asn Asn Gly Arg Cys Asn Leu Gly 

20 25 30 

Val Xaa Thr Leu Asn Leu Pro Arg Met Ala Leu- Glu Ser Ala Gly Asn 

35 40 45 

Met Thr Lys Phe Trp Glu He Phe Tyr Glu Arg He Asp Val Leu His 

50 55 60 

Asp Ala Leu Leu Tyr Arg He Asn Arg Leu Lys Asp Xaa Val Pro Asn 
65 70 75 80 

Asn Ala Pro He Leu Tyr Lys Ser Gly Ala Phe Asn Tyr Lys Leu Lys 

85 90 95 

Glu Thr Asp Asp Val Ala Glu Leu Phe Lys Asn Lys Arg Ala Thr He 
100 105 no 

Ser Met Gly 
115 



(2) INFORMATION FOR SEQ ID NO: 3 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 



45 



Val Xaa Ser Thr Lys Thr Ala Leu Xaa Gly Asp Ala Lys Leu Asn Glu 

1 5 10 15 

Ala Lys Ala Ala Ala Lys Gin Thr Leu Gly Thr Leu Thr His He Asn 

20 25 30 

Asn Ala Gin Arg Asn Ala Leu Asp Asn Glu lie Thr Xaa Ala Thr Asn 

35 40 45 

Val Glu Val Leu He Gin Leu Lys Pro Lys Arg Asn Asn 
50 55 60 



(2) INFORMATION FOR SEQ ID NO:362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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.(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3 62: 

Val lie Glu Met Ala Lys Lys Ala Pro Asp Val Gly Asp Tyr Lys Tyr 

15 10 15 

Gly Phe Pro Ala Asp Asp Val Ser He Phe Arg Ser Glu Arg Gly Leu 

20 25 30 

Thr Glu Asn He Val Arg Glu He Ser Asn Met Lys Asn Glu Pro Glu 

35 40 45 

Trp Met Leu Asp Phe Arg Leu Lys Ser Leu Lys Leu Phe Tyr Lys Met 

50 55 60 

Pro Met Pro Gin Trp Gly Gly Asp Leu Ser Glu Leu Asn Phe Asp Asp 
65 70 75 80 

He Thr Tyr Tyr Val Lys Pro Ser Glu Gin Ala Glu Arg Ser Trp Asp 

85 90 95 

Glu Val Pro Glu Glu He Lys Arg Thr Phe Asp Lys Leu Gly He Pro 

100 105 110 

Glu Ala Glu Gin Lys Tyr Leu Ala Gly Val Ser Ala Gin Tyr Glu Ser 

115 120 125 

Glu Val Val Tyr His Asn Met Glu Lys Glu Leu Glu Glu Lys Gly He 

130 135 140 

lie Phe Lys Asp Thr Asp Ser Ala Leu Gin Glu Asn Glu Glu Leu Phe 
145 150 155 160 

Lys Lys Tyr Phe Ala Ser Val Val Pro Ala Ala Asp Asn Lys Phe Ala 

165 170 175 

Ala Leu Asn Ser Ala Val Trp Ser Gly Xaa Ser Leu He Tyr Val Pro 

180 185 190 

Lys Asn He Lys Leu Asp Thr Pro Leu Gin Ala Tyr Phe Arg He Asn 

195 200 205 

Ser Glu Asn Met Gly Gin Phe Glu Arg Thr Leu He lie Ala Asp Glu 

210 215 220 

Gly Ala Ser Val His Tyr Val Glu Gly Trp Tyr Cys Thr Ser Leu Tyr 
225 230 235 ' 240 

Asn 



(2) INFORMATION FOR SEQ ID NO: 3 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 275 amino acids 

(B) TYPE: amino acid 
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.(C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:363: 

Val Lys Gin His Lys Ala Trp Val Asn lie Met Tyr Gly Cys Asp Lys 

15 10 15 

Phe Cys Thr Tyr Cys lie Val Pro Xaa Thr Arg Ser Lys Asp Arg Thr 

20 25 30 

Arg Arg Pro Glu Asp lie He Asp Glu Val Arg Glu Leu Ala Arg Glu 

35 40 45 

Gly Tyr Lys Glu He Thr Leu Leu Gly His Asn Val Asn Ser Tyr Gly 

50 55 60 

Lys Asp Leu Gin Asp He Glu Tyr Asp Leu Gly Asp Leu Leu Gin Ala 
65 70 75 80 

He Ser Lys He Ala He Pro Arg Val Arg Phe Thr Thr Ser His Pro 

85 90 95 

Trp Asp Phe Thr Asp His Met He Asp Val He Ser Glu Gly Gly Asn 

100 105 110 

He Val Pro His He His Leu Pro Val Gin Ser Gly Asn Asn Ala Val 

115 120 125 

Leu Lys He Met Gly Arg Lys Tyr Thr Arg Glu Ser Tyr Leu Asp Leu 

130 135 140 

Val Lys Arg He Lys Asp Arg Leu Pro Asn Val Ala Leu Thr Thr Asp 
145 150 155 160 

He He Val Gly Tyr Pro Asn Glu Ser Glu Glu Gin Phe Glu Glu Thr 

165 170 175 

Leu Thr Leu Tyr Asp Glu Val Gly Phe Glu His Ala Tyr Thr Tyr Leu 

180 185 190 

Tyr Ser Gin Arg Asp Gly Thr Pro Ala Ala Lys Met Lys Asp Asn Val 

195 200 205 

Pro Leu Asn Val Lys Lys Glu Arg Leu Gin Arg Leu Asn Lys Lys Val 

210 215 220 

Gly His Tyr Ser Gin He Ala Met Ser Lys Tyr Glu Gly Gin Thr Val 
225 230 235 240 

Thr val Leu Cys Glu Gly Ser Ser Lys Lys Asp Asp Gin Val Leu Ala 

245 250 255 

Gly Tyr Thr Asp Lys Asn Lys Leu Val Asn Phe Lys Ala Pro Lys Glu 
260 265 270 

Met He Gly 
275 
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(2) INFORMATION FOR SEQ ID NO: 3 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 64: 



Val Thr Gin Tyr Leu Val Thr Thr 

1 5 
His Glu His lie Thr Val Ala Arg 
20 

Glu Ala Glu Ser Lys Glu Glu Ala 

35 40 
Lys lie Arg Arg Asp Gly Asp Ala 
50 55 



Phe Lys Asp Ser Thr Gly Arg Pro 

10 15 
Asp Asn Gin Thr Phe Thr Val lie 
25 30 
Glu Arg Lys Tyr Glu Ala Gin Val 
45 

Lys Glu Asn Xaa Asn Asp 
60 



(2) INFORMATION FOR SEQ ID NO: 365: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:365: 



Val Tyr Lys Lys Ala Gin Ala Phe 

1 5 
Asn Ala lie Gin His Ser Val Lys 
20 

Val Gly He Met Ala Gly Gin Val 
35 40 



Asp Glu He Leu Glu Gly Met Thr 

10 15 
Xaa Gly He Glu Leu Asp Glu Ala 
25 30 
Val Tyr Lys Tyr Glu Glu Glu 
45 



(2) INFORMATION FOR SEQ ID NO: 366: 



<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 105 amino acids 
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.(B) TYPE: amino acid 

(C) STRANDEDNESS : single, 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 



Val lie Asp Met Lys Phe Lys Lys 

1 5 
Gly Val Leu Ala Thr Gly Val Val 
20 

Lys Val Tyr Ser Gin Asn Gly Leu 

35 40 
Leu Glu His Glu Leu Ser Tyr lie 

50 55 
Asp Gin Ala Thr Lys Asp Asn Leu 
65 70 
Leu His Ser lie Lys Asp lie lie 
85 

Asp Val Ser Lys Tyr Glu His Val 
100 



Val Leu Val Ala Thr Ala Met Val 

10 15 
Gly Tyr Gly Asn Gin Ala Asp Ala 
25 30 
Val Leu His Asp Asp Ala Asn Phe 
45 

Asp Val Leu Leu Asp Lys Asn Ala 
60 

Arg Ser Tyr Phe Ala Asp Lys Gly 

75 80 
Asn Lys Ala Lys Gin Asp Gly Phe 
90 95 

Lys 
105 



(2) INFORMATION FOR SEQ ID NO: 367: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 



Val Thr Asn Lys Glu Tyr Glu lie 

1 5 
Met Met Lys Arg Leu Asn Lys Leu 
20 

Leu Val lie Ser lie Thr Ala Gly 

35 40 
Val Lys Lys Ser Phe Glu Lys Thr 
50 55 



Glu Pro Gly Lys Arg Glu Cys Glu 

10 15 
Val Leu Gly lie Ser Phe Leu Phe 
25 30 
Cys Gly lie Gly Lys Glu Ala Glu 
45 

Leu Ser Met Tyr Pro lie Lys Asn 
60 
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Leu Glu Asp Leu Tyr Asp Lys Glu 
65 70 
Lys Asn Asp Lys Gly Thr Trp He 
85 

Pro Asn Asn Glu Asp Met Val Ala 
100 



Gly Tyr. Arg Asp Asp Gin Phe Asp..-. 

75 80 
He Asn Ser Glu Met Val He Gin 

90 95 
Lys Gly Met Val Leu Tyr He Glu 
105 110 



(2) INFORMATION FOR SEQ ID NO: 3 68: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 68: 



Val He Asn He Pro Lys Met Leu 

1 5 
Ser Asn Pro Pro He Leu Pro Leu 
20 

Leu Lys Lys Lys Tyr Ser Phe Val 

35 40 
Ala He Lys Thr Gly Ala Thr Arg 

50 55 
Met Arg Tyr He Asn Arg His Val 
65 70 
Val Leu Gly Tyr Gly Asn Glu Lys 
85 

Leu Lys Met Leu Thr He Ser Met 
100 



Lys Tyr Asp Gin He Leu Val Tyr 

10 15 
He Pro Asp Val Leu His Arg Leu 
25 30 
Val Tyr Asp He Ala Pro Asp Asn 
45 

Pro Gly Ser Met He Asp Lys Leu 
60 

Tyr Lys Asn Ala Glu Asn Val He 

75 80 
Leu Leu Thr Lys Ser Ser Asn Phe 
90 95 



(2) INFORMATION FOR SEQ ID NO: 369: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 
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(xi) SEQUENCE DESCRIPTION 

Val Lys Tyr lie Glu Ala Pro Tyr 

1 5 
Val Lys Asp Lys Glu Leu Ala Asp 
20 

Thr Ala Cys Phe Glu Met Arg Leu 

35 40 
Glu His Gly Tyr Asp Tyr Phe Gly 

50 55 
Lys Asn Ala Gin Leu lie Asn Glu 
65 70 
lie Arg Cys Glu Leu lie Cys Lys 
85 



SEQ ID NO: 369: , 

Glu Pro His Lys Phe Val Lys Met 

10 15 
Glu Lys Glu Gly Gly Leu Arg Xaa 
25 30 
Asp He Val Ala Lys Ala Ala Val 
45 

Ser Ala He Thr Leu Ser Pro Lys 
60 

Leu Gly Met Asp Cys Pro Lys Asn 
75 80 



(2) INFORMATION FOR SEQ ID NO: 370: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:370: 



Val Thr Glu He Lys Tyr Lys Val He Thr Lys Asp Ala Phe Ala Leu 

15 10 15 

Pro Tyr Thr He He Lys Ala Lys Asn Gin Pro Thr Lys Gly Val He 

20 25 30 

Val Tyr He His Gly Gly Gly Leu Met Phe Gly Lys Ala Asn Asp Leu 

35 40 45 

Ser Pro Gin Tyr He Asp He Leu Thr Glu His Tyr Asp Leu He Gin 

50 55 60 

Leu Ser Tyr Arg Leu Leu Pro Glu Val Ser Leu Asp Cys He He Glu 
65 70 75 80 

Asp Val Tyr Ala Ser Phe Asp Ala He Gin Ser Gin Tyr Ser Asn Cys 

85 90 95 

Pro He Phe Thr Phe Gly Arg Ser Ser Gly Ala Tyr Leu Ser Leu He 

100 105 110 

Asn Cys Thr Arg Gin Arg Tyr 
115 



282 



EP 0 841 394 A2 



. (2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi} SEQUENCE DESCRIPTION: SEQ ID NO:371: 

Val lie Asp Val Tyr Asn Gin lie Lys Asn Asn Leu Glu Ser Met Thr 

15 10 15 

Pro Glu Thr Ala Thr lie Gly Arg Leu Val Asp Thr Lys Thr Leu Phe 

20 25 30 

Leu Glu Tyr Arg Lys Lys Leu Gin Asp Val Tyr Thr Asp Val Glu Asp 

35 40 45 

Val Lys lie Ala He Ser Asp Arg Phe Lys Leu Leu Gin Ser Gin Tyr 

50 55 60 

Thr Asp Glu Lys Tyr Lys Glu Ala Leu Glu He He Ala Thr Lys Phe 
65 70 75 80 

Gly Leu Thr Val Asn Glu Asp Leu Gin Leu Val Gly Glu Pro Asn Val 

85 90 95 

Val Lys Ser Ala He Glu Ala Ala Arg Glu Ser Thr Lys Glu Gin Leu 

100 105 110 

Arg Asp Tyr Xaa Lys Thr Ser Asp Tyr Xaa Thr Asp Xaa Asp Gly He 

115 120 125 

Val Glu Arg Xaa Asp Thr Ala Glu Ala Glu Arg Thr Thr Xaa Xaa Gly 

130 135 140 

Glu lie Lys Asp Lys Xaa Thr Val Xaa Arg He Ser Lys Arg lie Gly 
145 150 155 160 

Arg Thr Lys Thr He Tyr 
165 

(2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 
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70 



20 



25 



35 



SO 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 

Val Leu Ala Asp He Thr Val Asn Ala Met Lys Gly He Tyr Leu Arg 

15 10 15 

Tyr Asp Glu Asn Gly Ala He Thr Ser His Thr He Asp Lys Asp Gly 

20 25 30 

Val Lys lie Ser Gly Asp Lys Val Asp He Thr Ala Asn Arg Glu Phe 

35 40 45 

Asn Val Phe Ala Asn Asn He Asn Asn Lys Val Gly Lys Asn Asp He 

50 55 60 

Val Asn Ser Leu Asn Leu Ser Asn Glu Gly Leu Asp He Asn Val Asn 
65 70 75 80 

Arg He Gly He Lys Gly Gly Asn Ala Asn Arg Tyr Val Gin Val Gin 

85 90 95 

Asn Asp Phe He Glu Leu Gly Gly He Val Gin Arg Thr Trp Lys Gly 

100 105 110 

Lys Arg Ser Thr Asp Asp He Phe Thr Arg Leu Lys Asp Gly His Leu 

115 120 125 

Arg Phe Arg Asn Asn Thr Ala Gly Gly Ser Leu Tyr Met Ser His Phe 

130 135 140 

Gly He Ser Thr Tyr He Asp Gly Glu Gly Glu Asp Gly Gly Ser Ser 
145 150 155 160 

Gly Thr He Gin Trp Trp Asp Lys Thr Tyr Ser Asp Ser Gly Met Asn 

165 170 175 

Gly He Thr He Asn Ser Tyr Gly Gly Val Val Ala Leu Thr Ser Asp 

180 185 190 

Tyr Asn Arg He He lie Asp Ser Tyr Ala Ser Ala Asn lie Glu Ser 

195 200 205 

Arg Glu Ala Pro He Tyr Leu Ser Pro Asn Thr Gin Lys Leu Asn Leu 

210 215 220 

Xaa Leu Xaa Arg Phe Ala Phe Thr Leu Ser Asn Ala Asp Arg Xaa lie 
225 230 235 240 

Thr Lys Leu Ala Val lie Ser Cys Trp Val Gin Asp Xaa Xaa Tyr Lys 

245 250 255 

Xaa Gly Ala Gly Leu Arg Phe Ser Lys Arg Thr Asn Lys Gly Leu Val 

260 265 270 

Gin Val Val Asn Gly Asp Tyr Ala Thr Gly Gly Asp Thr Thr He Glu 

275 280 285 

Ser Gly Met Ala Lys Phe Asn Leu Val Xaa Arg Lys Arg Trp Lys 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 373: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 



Val Asn Pro Thr His Val Lys Tyr 

1 5 
Glu Val Leu Val Cys Thr Val He 
20 

Thr Ala Thr Lys Ala Phe Glu Thr 

35 40 
Asp Glu lie Asp Met Val lie Asn 

50 55 
Phe Asp Asp Val Gin Gin Asp He 
65 70 
Gly His Thr Val Lys Val He He 
85 



Ala Ala Glu Arg Leu Ala Asp Ser 

10 15 
Gly Phe Pro Leu Gly Ala Ser Thr 
25 30 
Glu Asp Ala He Gin Asn Gly Ala 
45 

He Gly Ala Leu Lys Asp Gly Arg 
60 

Glu Ala Val Val Lys Ala Ala Lys 

75 80 
Glu Thr Val Leu Leu Glu Pro 
90 95 



(2) INFORMATION FOR SEQ ID NO: 374: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 



Val Lys lie Tyr Glu Asp Thr Asn He Asp Thr Leu Xaa Leu Leu Asn 

15 10 15 

Glu Ala Xaa He Phe Lys Xaa Thr Leu Phe Trp Cys Xaa Xaa Ser Asn 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 37 5: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 55 amino acids 
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.(B) TYPE: amino acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 

Val Ala His Asp Asn His He Gly Thr Tyr Cys He Val Met Ser Gly 

15 10 15 

Arg Gly Pro Ser Asp Lys Glu Val Asp Kis He Ser Asn Pro Val Arg 

20 25 30 

Thr He Lys Ser Gin His Pro Gin Leu Lys He Cys Ala Cys Leu Gly 

35 40 45 

Leu Thr Asp Cys Arg Pro Ser 
50 55 



(2) INFORMATION FOR SEQ ID NO: 376: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:376: 



Val He Asp Asp Leu Val Ser Asp 

1 5 
Val Lys Met Ser Asp Leu Thr Ser 
20 

Cys Gly Thr Trp Arg Leu Val Tyr 

35 40 
Leu Thr His Glu His Val His Leu 

50 55 
Val Ser 
65 



His He Ala Thr Gly Gin Leu Leu 

10 15 
Ser Tyr Glu Pro Pro He Glu Ala 
25 30 
Gin Arg Leu Lys Ala Leu Glu Val 
45 

Glu Asn His Val Leu Phe Lys Lys 
60 



(2) INFORMATION FOR SEQ ID NO: 3 77: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 54 amino acids 
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.(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 

Val Leu Met lie Phe Gly Pro Glu Gly Gly Leu Ser Glu lie Glu He 

15 10 15 

Ser Leu Phe Ser Asn Thr Ser Thr Val Val Gly Leu Gly Pro Arg He 

20 25 30 

Leu Arg Ala Glu Thr Ala Pro Leu Tyr Ala Leu Ser Ala He Ser Tyr 

35 40 45 

Glu Lys Glu Leu Met Gly 
50 

(2) INFORMATION FOR SEQ ID NO: 378: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 

Val Ser Ser He Trp Lys Glu Lys Glu Phe Glu Pro Ser Asp He Val 

15 10 15 

Asp Ala Tyr Leu Val He Ala Ala Thr Asn Glu Pro Arg Val Asn Glu 

20 25 30 

Ala Val Lys Gin Ala Leu Pro Glu His Ala Leu Phe Asn Asn Val Gly 

35 40 45 

Asp Ala Ser Asn Ala Asn Val Val Phe Pro Ser Ala Leu His Arg Asp 

50 55 60 

Lys Leu Thr He Ser Val Ser Thr Asp Gly Ala Ser Pro Lys Leu Thr 
65 70 75 80 

Lys Ser He Met Ala Glu Leu Glu Ala Leu Tyr Pro Pro Ser Tyr Ser 

85 90 95 

Ser Tyr He Asp Phe Leu Tyr Thr Cys Arg Gin Lys He Lys Val Leu 
100 105 110 
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Asp He Thr Tyr He Arg Lys Ala Thr Val Thr Val Thr Asn Cys Val v . 

115 120 125 

Thr Arg He Phe Lys Ser 
130 

(2) INFORMATION FOR SEQ ID NO: 379: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 244 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 

Val Val Asn His Leu Cys His Gly Glu Asp Gin Asp Val Xaa Ala His 

15 10 15 

Ala He Thr Asp Leu Glu Asn Lys Gly He Ala Leu His Thr Asn Val 

20 25 30 

Glu Thr Thr Glu Leu Ser Ser Asp Asp His His Thr Thr Val Arg Xaa 

35 40 45 

Asn Val Gly Asn Xaa Glu Ala Asp Ala Val Leu Leu Ala He Gly Arg 

50 55 60 

Lys Pro Asn Thr Ala Leu Ala Leu Glu Asn Thr Asp He Glu Leu Gly 
65 70 75 80 

Asp Arg Gly Glu He Lys Val Asn Ala Gin Leu Gin Thr Ser Val Pro 

85 90 95 

His He Tyr Ala Ala Gly Asp val Lys Gly Gly Leu Gin Phe Thx Tyr 

100 105 110 

He Ser Leu Asp Asp Tyr Arg He He Lys Ser Ala Leu Tyr Gly Asn 

115 120 125 

Gin Ser Arg Thr Thr Asp Asn Arg Gly Ser Val Pro Tyr Thr Val Phe 

130 135 140 

He Asp Pro Pro Leu Ser Arg Val Gly Leu Thr Ser Lys Glu Ala Ala 
145 150 155 160 

Ala Gin His Tyr Asp Tyr Thr Glu His Gin Leu Leu Val Ser Ala He 

165 170 175 

Pro Arg His Lys He Asn Asn Asp Pro Arg Gly Leu Phe Lys Val Val 

180 185 190 

He Asn Asn Glu Asn Asn Met He Leu Gly Ala Thr Leu Tyr Gly Lys 
195 200 205 
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Gin Ser _Glu Glu Leu He Asn lie lie Lys .Leu Ala He Asp Gin Asa,, 

210 215 220 

He Pro Tyr Thr Val Leu Arg Asp Ser He Tyr Ser His Pro Thr Met 
225 230 235 240 

Gly Arg He He 



(2) INFORMATION FOR SEQ ID NO: 3 80: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 



Val Val Glu Trp Ala Leu Tyr He Ala Lys Asn Lys He Ala He Asp 

15 10 15 

Val Pro Gly Ser Gly Met Gly Ala Gin Cys Trp Glu Phe Thr 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 381: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:381: 



Val Leu Glu Leu Arg Glu Glu His 

1 5 
Glu Ser His Lys Glu Leu Lys Asp 
20 

Asn Leu Glu Gin Thr Lys He Leu 

35 40 
Thr Gin Val Xaa Val Xaa Gin Lys 
50 55 



Lys Gin His His Asn Glu Leu Arg 

10 15 
Lys Gin Asp Lys Val Val Asp Glu 
25 30 
Asn Arg He Glu Glu Arg Tyr Xaa 
45 

Asn Glu Glu Lys Thr Leu Ala Gin 
60 
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Asn Lys Trp Leu Val Gly Ala lie Trp Ala Leu Val Thr He Val Met v . 
65 70 75 80 

He Ala Val He Thr Ala Ser He Xaa Ala Leu Leu Pro 



(2) INFORMATION FOR SEQ ID NO: 3 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:382: 

Val Asn Ser Thr Leu He Arg He Gly Asp He lie Gin Ser Ser Thr 

15 10 15 

Asn Thr Asn Pro lie He Met Gly lie lie Leu Gly Gly lie lie Thr 

20 25 30 

Val Val Gly Thr Ala Pro Leu Ser Ser Met Ala Leu Thr Ala Leu Leu 

35 40 45 

Gly Leu Thr Gly Val Pro Met Ala He Gly Ala Met Ala Ala Phe Ser 

50 55 60 

Ser Ala Phe Met Asn Gly Thr Leu Phe His Arg Leu Lys Leu Gly Asp 
65 70 75 80 

Arg Lys Ser Thr lie Ala Val Ser lie Glu Pro Leu Ser Gin Ala Asp 

85 90 95 

lie Val Ser Ala Asn Pro lie Pro lie Tyr He Thr Asn Phe Phe Gly 

100 105 110 

Gly Ala lie Ala Gly Leu lie lie Ala Met Ser Gly Leu He Asn Asp 

115 120 125 

Ala Thr Gly Thr Ala Thr Pro lie Xaa Gly Phe Leu Val Met Xaa Gly 

130 135 140 

Phe Asn His Pro Met Thr lie Val He Tyr Gly Val Val Met Ala lie 
145 150 155 160 

Val Gly cys Ala Cys Arg Val lie Leu Gly Ser lie Arg lie 



85 



90 



165 



170 



(2) INFORMATION FOR SEQ ID 



NO: 383: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 76 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 

Val Gly Lys Thr Gly Phe Ala Leu Asn Met Met Leu Asn He Ala Arg 

15 10 15 

Asn Gly Tyr Lys Thr Ser Phe Phe Ser Leu Glu Thr Thr Gly Thr Ser 

20 25 30 

Val Leu Lys Arg Met Leu Ser Thr He Thr Gly He Glu Leu Thr Lys 

35 40 45 

He Lys Glu He Arg Asn Leu Thr Pro Asp Asp Leu Thr Lys Leu Thr 

50 55 60 

Asn Ala Met Gly Ser Lys Ser Leu Lys Leu Gly He 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 384: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 

Val Pro Asn Glu Thr Glu Ala Glu Leu Leu Ser Gly He Lys Val Thr 

15 10 15 

Asn Glu Gin Ser Met Lys Asp Asn Ala Asn Tyr Phe Leu Ser Leu Gly 

20 25 30 

He Lys Thr Val Leu He Thr Leu Gly Lys Gin Gly Thr Tyr Phe Ala 

35 40 45 

Thr Lys Asn Gin Ser Gin His He Glu Ala Tyr Lys Val Asn Ala He 

50 55 60 

Asp Thr Thr Ala Ala Gly Asp Thr Phe He Gly Ala Phe Val Ser Arg 
65 70 75 80 

Leu Asn Lys Ser Gin Asp Asn Leu Ala Asp Ala He Asp Phe Gly Asn 
85 90 95 
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Lys Ala Ser ser Leu Thr val Gin Lys Thr Arg Arg Ala ser lie Tyr,_, t 

100 105 110 

Ser Ser Thr Arg Arg Ser Lys Ser Ser Leu Asn Glu Ser Asn Thr Ala 
115 120 125 

Met lie 
130 

(2) INFORMATION FOR SEQ ID NO: 3 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 85: 

Val Tyr Lys Met Arg Lys lie Gly Tyr Ala Arg Val Ala Tyr Pro Asp 

15 10 15 

Gin Asn Leu Asp Thr Gin Leu Thr Lys Leu Leu lie Asn Gly Cys Asp 

20 25 30 

Leu Val Tyr Ser Glu Gin Val Asn Val Tyr Tyr Lys Glu Gin Leu Glu 

35 40 45 

Leu Glu His Cys Leu Asp Glu Leu Lys Thr Asp Asp Thr Leu Val lie 

50 55 60 

Glu Lys Leu Lys Val Leu Gly Phe Thr Pro Lys Lys Leu Met Glu Phe 
65 70 75 80 

Phe Glu Ser Arg He Leu Pro Tyr Asp He His Leu Glu Val Leu Asp 

85 90 95 

Leu Gly He Asn Thr Asn Ser Glu Glu Gly Gin Ser Phe He Glu Val 

100 105 110 

Phe Lys Met Leu Ala Asp Ser Glu Asn He Leu Leu Lys Glu Arg Thr 

115 120 125 

Thr Asn Gly Leu Glu Ser Ala Lys Glu Arg Gly Arg Tyr Gly Gly Arg 

130 135 140 

Pro Gin Leu Ser Glu Asp Lys Arg Lys Tyr He Lys Gin Leu Phe Ala 
145 150 155 160 

Ser Arg Met Tyr Thr Pro Asn Glu He Ser Lys Trp 
165 170 

(2) INFORMATION FOR SEQ ID NO: 38 6: 
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10 



20 



30 



45 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 86: 

Val Leu lie Met He Asn Glu Met Leu Asn Pro Lys Gin Gin Glu Val 

15 10 15 

Trp Thr Cys Phe He Asn Asp Lys Pro Lys Val Leu He Ala Ser Gly 

20 25 30 

Ala Lys Arg Ala Gly Lys Thr Tyr Val Phe He Leu Leu Phe Leu Met 

35 40 45 

His He Ala Thr Tyr Lys Asp Lys Gly Leu Asn Phe He He Gly Gly 

50 55 60 

Ala Thr Gin Ala Ser He Arg Arg Asn He Leu Asp Asp Met Glu Leu 
65 70 75 80 

He Leu Gly Arg Glu Leu Thr Leu Asp Lys Ser Asn Ala Val Lys He 

85 90 95 

Phe Gly Asn Lys Val Tyr Val Phe Asp Gly Gin Asn Ser Asp Ala Trp 

100 105 110 

Lys Lys Ala Arg Gly Phe Thr Ser Ala Gly Ala Phe Leu Asn Glu Gly 

115 120 125 

Thr Ala Leu His Asn Met Phe He Lys Glu Val Phe Ser Arg Cys Ser 

130 135 140 

Tyr Lys Gly Ala Arg He Leu He Asp Thr Asn Pro Glu Asn Pro Met 
145 150 155 160 

His Pro Val Lys Lys Asp Tyr He Asp Lys Ser Gly Gin Arg Leu Ser 

165 170 175 

Asn Gly Arg Leu Asn He Lys Ala Phe Gin Phe Thr Leu Phe Asp Asn 

180 185 190 

Thr Phe Leu Asp Glu Glu Tyr He Glu Ser He lie Ala Ser Thr Pro 

195 200 205 

Thr Gly Met Phe Thr Asp Arg Asp He Tyr Gly Lys Trp Val Ser Ala 

210 215 220 

Glu Gly Val Val Tyr Lys Asp Phe Lys Glu Lys Val His Tyr He Thr 
225 230 235 240 

Glu Glu Glu Phe Lys Thr Lys Gin He Lys Arg Lys Tyr Ala Gly Val 

245 250 255 

Asp Trp Gly Tyr Glu His Tyr Gly Ser He Met Val Val Ala Glu Asp 
260 265 270 
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Phe Asp Gly Asn Lys Tyr Val lie Glu Glu .His Ala His Arg His Lys,. 

275 280 285 

Glu lie Asp Asp Trp Val Ala lie Ala Lys Glu Leu 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

{C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 

Val Asn Leu lie Arg Ser His Ala Cys Gly Leu Gly Glu Pro Phe Pro 

15 10 15 

Lys Glu Val Ala Leu Val Met Met lie Xaa Arg Leu Asn Thr Leu Leu 

20 25 30 

Lys Gly His Ser Gly Ala Thr Leu Val He Ser Glu Thr He Thr He 

35 40 45 

Phe Tyr Lys 
50 

(2) INFORMATION FOR SEQ ID NO: 3 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 199 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 88: 

Val He His Val Leu Asp Phe Asn Asp Lys He He Asp Phe Leu Ser 

15 10 15 

Thr Asp Asp Pro Ser Leu Val Arg Ala He His Lys Arg Asn Val Asn 

20 25 30 

Asp Asn Ser Glu Met Leu Glu Leu Leu He Ser Ser Glu Arg Ala Glu 
35 40 45 
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xaa Phe Arg Glu Arg His Arg Val lie lie Arg Asp Ser Asn Lys Gin,.,. 

50 55 60 

Trp Arg Glu Phe lie lie Asn Trp Val Gin Asp Thr Met Asp Gly Tyr 
65 70 75 80 

Thr Glu lie Glu Cys He Ala Ser Tyr Leu Ala Asp He Thr Thr Ala 

85 90 95 

Lys Pro Tyr Ala Pro Gly Lys Phe Glu Lys Lys Thr Thr Ser Glu Ala 

100 105 110 

Leu Lys Asp Val Leu Ser Asp Thr Gly Trp Glu Val Ser Glu Xaa Thr 

115 120 125 

Glu Tyr Asp Gly Leu Arg Thr Thr Ser Trp Thr Ser Tyr Xaa Thr Arg 

130 135 140 

Tyr Glu Val Leu Lys Gin Leu Cys Thr Thr Tyr Lys Met Ala Leu Asp 
145 150 155 160 

Phe Tyr He Glu Leu Ser Ser Asn Thr Val Lys Gly Arg Tyr Val Val 

165 170 175 

Leu Asn Lys Lys Asn Ser Leu Phe Gin Gly Lys Glu He Glu Tyr Gly 

180 185 190 

Lys Asp Leu Ala Trp Val Asn 
195 

(2) INFORMATION FOR SEQ ID NO: 3 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:389: 

Val Leu Val Gin Gly Gin Val Cys Ser His Leu Ser Thr Xaa Leu Gly 

15 10 15 

Leu He He He Asp Glu Glu His Glu Ser Thr Tyr Lys Gin Glu Asp 

20 25 30 

Tyr Pro Arg Tyr His Ala Arg Glu lie Ala Gin Trp Arg Ser Glu Tyr 

35 40 45 

His His Cys Pro Val He Leu Gly Ser Ala Thr Pro Cys Leu Glu Ser 

50 55 60 

Tyr Ala Arg Ala Glu Lys Asp Val Tyr His Leu Leu Ser Leu Pro Asn 
65 70 75 80 
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Arg Val Asn Gin Gin Ala Leu Pro Glu lie Asp lie Val Asp Met Arg,,. 

85 90 95 

Glu Glu Leu Ser Glu Gly Asn Arg Ser Met Phe Ser Lys Asp Leu Arg 

100 105 110 

Glu Ala lie Gin Leu Arg Xaa Arg Ser Thr Gly Thr Ser Cys Phe lie 

115 120 125 

Phe Lys Ser Asp Val Val Met His Arg Leu Cys Tyr Val Gly lie Val 

130 135 140 

Asp Met Tyr Arg Asn Val Gin Xaa Val He Phe His 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 3 90: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH : 88 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:390: 



Val Lys Tyr Leu Asn Asp Asp lie 

1 5 
Leu Glu Ser Asn Met Lys Leu Arg 
20 

Asn Val Asp Lys Leu Thr Ser Ser 

35 40 
Thr Gly Glu Pro Cys Leu Arg Tyr 

50 55 
Gly Lys Pro Phe Asp Ser Ser Asp 
65 70 
Gin Phe Tyr He Pro Ser Lys Lys 
85 



Ala Lys Gly Ser He Phe Asp Tyr 

10 15 
He Gly Phe Ser Asp lie Phe Phe 
25 30 
Glu Ala Ser Leu Leu Gin Leu Ser 
45 

His Gin Thr Phe Tyr Thr Met Thr 
60 

He Val Phe His Tyr Xaa His Ala 
75 80 



(2) INFORMATION FOR SEQ ID NO:391: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : S ingle 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:391: 



Val Arg His Leu His lie Pro Leu 

1 5 
Lys Arg Met Arg Arg Lys Tyr Thr 
20 

Thr Lys Leu His Lys Ala Leu Pro 

35 40 
lie val Gly Phe Pro Gly Glu Thr 

50 55 
Asp Phe He Val Lys His Lys Phe 
65 70 
Ser Pro Arg He Gly Thr Pro Ala 
85 

Glu Glu He Lys Asn Glu Arg Val 
100 



Gin Ser Gly Ser Asp Thr Val Leu 

10 15 
Met Asp Arg Phe Ser Glu Arg Leu 
25 30 
Asp Leu Ala Val Thr Ser Asp Val 
45 

Glu Ala Glu Phe Gin Glu Thr Tyr 
60 

Ser Glu Leu His Val Phe Pro Tyr 

75 80 
Ala Arg Met Asp Asp Gin He Asp 

90 95 
His Lys Val Asn Tyr Ala 
105 110 



(2) INFORMATION FOR SEQ ID NO: 392: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: ainino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 



val Phe Val Val Glu Ala Leu val Lys Thr Arg Glu Gly His Gly Asn 

15 10 15 

Leu Glu Leu Leu Asp Lys Glu Val Ala Thr Pro Leu Asp Asp Lys Val 

20 25 30 

Lys He Lys Val His Tyr Ala Gly He Cys Gly Thr Asp Leu His Thr 

35 40 45 

Tyr Glu Gly His Tyr Xaa Val Asn Phe Pro Val Thr Leu Gly His Glu 

50 55 60 

Phe Ser Gly Xaa He Val Glu Val Gly Ser Arg Arg 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 393: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393: 

Val Thr Ser His Val Asn Ser He Lys He His He Gly Arg Xaa Lys 

15 10 15 

Gin Gly Lys Thr He Leu Lys Asn He Ser Cys His He Pro Lys Gly 

20 25 30 

Asp Lys Trp He Leu Tyr Gly Leu Xaa Gly Ala Gly Lys Thr Thr Leu 

35 40 45 

Leu Asn He Leu Asn Ala Tyr Glu Pro Ala Thr Thr Gly Gly Val Asn 

50 55 60 

Leu Phe Gly Lys Met Pro Gly Lys Val Gly Tyr Ser Ala Glu Thr Xaa 
65 70 75 80 

Arg Gin His He Gly Phe Val Ser His Ser Leu Leu Glu Lys Xaa Pro 
85 90 95 

Arg Gly 



(2) INFORMATION FOR SEQ ID NO: 394: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 94: 



Val Val He Ser Gly Ala Phe Lys 

1 5 
Asp Asp Glu Val Arg Asn Glu Ala 
20 

He Ser Ala Lys Ala Gin Gin Tyr 
35 40 



Ser Thr Gly Val Tyr Gin Asp He 

10 15 
His His Leu Leu Lys Leu Val Gly 
25 30 
He Gly Tyr Leu Ser Thr Gly Glu 
45 
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70 



15 



45 



50 



55 



Lys Gin Arg Val Met lie Ala' Arg Ala Leu Met Gly Gin Pro Gin Val, 

50 55 60 

Phe Asn Phe Arg 
65 

(2) INFORMATION FOR SEQ ID NO: 3 95: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395: 

Val Thr Xaa Gly Val Xaa Pro Lys Pro Leu His Ala Asn Leu Met He 
25 1 5 10 15 

Arg Thr Leu Ser Gly His He Gin His He Ala Phe Gly Pro He Ala 

20 25 30 

Ser Leu Glu Ser lie Lys His Leu Gly Thr Asn Gly Gly Gly Phe Leu 
30 3 5 4 0 4 5 

Ala Gly Asn Ser Ala Thr Pro Phe Glu Asn Pro Asn He Trp Ser Asn 

50 55 60 

Phe He Glu Met Gly Ser Met Met Leu Leu Pro Met Ser Met Leu Phe 
35 65 70 75 80 

Leu Phe Gly Arg Met Leu Ser Arg His Gly Lys Arg Val His Arg His 

85 90 95 

Ala Leu He Leu Phe Val Ala Met Phe Phe He Phe He Ala He Leu 
«> 100 105 110 

Thr Leu Thr Met Trp Ser Glu Tyr Arg Gly Asn Pro He Leu Ala Asn 

115 120 125 

Leu Gly He Tyr Gly Pro Asn Met Glu Gly Lys Glu Val Arg Phe Gly 

130 135 140 

Ala Gly Leu Ser Ala Leu Phe Thr Val He Thr Thr Ala Phe Thr Thr 
145 150 155 160 

Gly Ser Val Asn Asn Met His Asp Ser Leu Thr Pro He Gly Gly Leu 

165 170 175 

Gly Pro Met Val Leu Met Met Leu Asn Val Val Phe Gly Gly Glu Gly 

180 185 190 

Val Gly Leu Met Asn Leu Leu He Xaa Xaa Leu Leu Thr Val Phe He 
195 200 205 
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Cys Ser Leu Met Val Gly Lys Thr 

210 215 
Gly Ala Arg Glu Met Lys Cys lie 
225 230 
lie Leu He Leu Val Phe Ser Ala 
245 

Ser Glu Ser He Thr Asn Pro Ser 
260 

Leu 



Pro Glu Tyr Leu Asn Met Pro Ile Vi 
220 

Val Leu Val Phe Leu He His Pro 
235 240 
Leu Ala Phe Met He Pro Gly Ala 

250 255 
Phe His Gly lie Ser Gin Val Met 
265 270 



(2) INFORMATION FOR SEQ ID NO: 396: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 



Val Cys Glu Phe lie Lys Pro Pro 

1 5 
Gly Val Val Gly Gly Tyr Val Xaa 
20 

Gin Ser Gin Ala Arg Gin Thr Ala 

35 40 
Lys Glu Ala Asp Xaa He Lys Lys 

50 55 
Glu Xaa Gin Leu Xaa Arg Glu Xaa 
65 70 
Arg Xaa Asp Leu Xaa Arg Gin Gly 
85 

Lys Leu Arg Ala His Ser 
100 



Thr His Val Xaa Gly He He Leu 

10 15 
Ala Arg Asn Xaa Xaa Leu Gin Lys 
25 30 
Xaa Asp He Val Xaa Gin Ala His 
45 

Glu Lys Leu Leu Xaa Ala Lys Glu 
60 

Thr Glu Ala Glu Xaa Arg Glu Arg 

75 80 
Asn Pro Thr Ser Ser Lys Arg Arg 
90 95 



(2) INFORMATION FOR SEQ ID NO: 397: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 



val Arg Leu His Leu Cys Leu His Gin Arg Lys Ala Phe Leu Thr Asp 

15 10 15 

Trp Ser Tyr He Ala Gly Asn He Ala He Val Ala He He Pro Leu 

20 25 30 

Leu He Tyr Phe Tyr Val Pro Phe Phe Lys Lys Leu Lys Val Thr Ser 

35 40 45 

Ala Tyr Glu Tyr Leu Glu Ala Arg Phe Gly Pro Ser He Arg Val Xaa 

50 55 60 

Gly Ser Leu Leu Phe Val Val Tyr His Leu Gly Arg Val Ala He val 
65 70 75 80 

He Tyr Leu Pro Thr Leu Ala He Thr Ser Val Ser Asp Met Asn Pro 

85 90 95 

Tyr He Xaa Gly Ser Leu Val Gly Leu Leu Cys He Xaa Xaa Thr Ser 

100 105 110 

Xaa Gly Xaa Phe Xaa Gly Val Arg Leu Glu 
115 120 



(2) INFORMATION FOR SEQ ID NO: 398: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 



Val Tyr Lys He Thr He Asn Glu 

1 5 
Phe Asn Pro He Thr Ser Leu Thr 
20 

Lys Ala Thr Phe Met Phe Asp Lys 

35 40 
Gin Glu Asp Asn Lys Gly Arg Lys 
50 55 



Val Phe Asn Met Thr Glu Thr Thr 

10 15 
He Asn Asn Glu Glu Val Lys Ala 
25 30 
Thr Ala Lys Lys Phe Ala Thr Glu 
45 

Gin Thr lie Ser Gly Phe Thr Asn 
60 
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Val Tyr Xaa Ala Leu Leu Glu Arg Asp Thr Val Ala He Val Asp Phe., 
65 70 75 80 

Trp Glu Cys Ala Thr Ala Tyr Leu Gly Lys Ser Ala Pro Lys Arg Glu 

85 90 95 

Asp He Glu Ala Glu He Met Glu He He Glu Arg Glu Asn Asp Thr 

100 105 110 

Leu He Phe Tyr Lys Val Arg Trp Thr 
115 120 

(2) INFORMATION FOR SEQ ID NO: 39 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:399: 

Val Pro Ser Asn Lys Val Ala Phe He Pro Ser Glu Ala Gin Phe He 

15 10 15 

Gin Leu Cys Gin Asp Asp Asn Asp Val Lys Gin Ala Ser His Asn Leu 

20 25 30 

Tyr Asp Gly Val Thr Phe Thr 
35 

(2) INFORMATION FOR SEQ ID NO: 400: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 

Val Asn Gly Val Gin Asp Asn Ser He Gly Ser Thr Asn Glu Ser Gin 

15 10 15 

Phe Leu Gly Asn Tyr He Val He Lys His Ala Glu Asn Glu Tyr Ser 
20 25 30 
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Leu He Ala His Leu Gin Gin Tyr 

35 40 
Asn Val Lys Tyr Gly Asp Phe Leu 

50 55 
Ser Thr Glu Pro His He His Phe 
65 70 
Glu Ala Cys Thr Ser Leu Lys He 
85 

He Lys Gly Asp Val Val Cys Gly 
100 



Ser lie He Val Asn Glu Gly Gin.,, 
45 

Gly Lys Val Gly Asn Ser Gly Asn 
60 

Gin Val Met Asn Asp Lys Asn He 

75 80 
Arg Phe Leu Asn Asn Leu Glu Leu 

90 95 
Leu Gin Gly Glu 
105 



(2) INFORMATION FOR SEQ ID NO: 401: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 



Val His Lys Glu Asn He Met Leu 

1 5 
Ser Val Leu Xaa Gin Met Ser Asp 
20 

Lys Ser Thr Phe Lys Asp Ala Val 

35 40 
Thr Gly Leu Pro Thr His Leu Cys 

50 55 
Val Glu His He Asn His Arg Thr 
65 70 
Glu Val Ala Val He Gly Met Gly 
85 



Asn Xaa Ser Ala Thr Asp Lys Glu 

10 15 
Val Leu Phe Gin Asn Gly Phe Val 
25 30 
He Asp Arg Glu Lys Glu Xaa Xaa 
45 

Ser Val Ala He Pro His Thr Asp 
60 

He Gly Val Gly Val Leu Glu Lys 

75 80 
Thr He Gly Ser Thr Gly Arg Arg 
90 95 



(2) INFORMATION FOR SEQ ID NO: 402: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 



Val Arg Ala Ala Phe Gly Lys Asn 

1 5 
Ser Val Ala Tyr Met Phe Asp His 
20 

Lys Ser Val Asp Glu lie Leu Glu 

35 40 
Met Xaa Met Met 
50 



Gly Gly Asn Met Gly Val Ser Gly 

10 15 

Val Ala Thr Phe Gly He Glu Gly 
25 30 

Thr Leu Met Glu Pro Arg Cys Lys 
45 



(2) INFORMATION FOR SEQ ID NO: 403: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:403: 

val Met Phe Asn Thr Thr Val Asn Ser Asp Thr Asp Val He Lys Tyr 

15 10 15 

Gly Arg Leu Leu Val Asp Lys Gly Ala Gin Ser Val He Val Ser Leu 

20 25 30 

Gly Gly Asp Gly Ala He Tyr He Asp Lys Glu He Ser He Lys Ala 

35 40 45 

Val Asn Pro Gin Gly Lys Val Val Asn Thr Val Gly Ser Gly Asp Ser 

50 55 60 

Thr Val Ala Gly Met Val Ala Gly He Ala Ser Arg Phe Asn Asp 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 404: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



304 



EP 0 841 394 A2 



(iX) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 

Val Pro lie Ser Asn Asp Ala lie Val Glu Leu Val Ser Arg He Lys 

15 10 15 

Pro Val Ser Glu Met Met Glu Arg Glu Thr Asp Leu Gly Val Ala Thr 

20 25 30 

Glu Phe Glu He He Thr Ala Met Met Phe Leu Tyr Phe Gly Glu He 

35 40 45 

His Pro Val Asp Phe Val He Val Glu Ala Gly Leu Gly lie Lys Asn 

50 55 60 

Asp Ser Thr Asn Val Phe Thr Pro Val Leu Ser He Leu Thr Ser He 
65 70 75 80 

Gly Leu Asp His Thr Asp He Leu Gly Gly Thr Tyr Leu Asp He Ala 

85 90 95 

Arg Asp Lys Gly Ala He He Lys Pro Asn Val Pro Val He Tyr Ala 

100 105 110 

Val Lys Asn Glu Asp Ala Leu Lys Tyr Val Arg 
115 120 



(2) INFORMATION FOR SEQ ID NO: 405: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

<C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 



Val Phe He Gly Thr Tyr Met He 

1 5 
Ala Val He Thr Asp Thr Asp Glu 
20 

Val Xaa Glu Asn Lys Val He Ser 

35 40 
Gin Ala Asp Thr He Cys Val His 

50 55 
Phe Val Ser Gin He Arg Glu He 
65 70 
Gin Ser Leu 



Leu Ser He Arg Lys Glu Ser Asp 

10 15 
Ala Leu Lys Gin Val Leu Lys Met 
25 30 
Gin Asn Asn Lys Glu Val Thr Leu 
45 

Gly Asp Gly Glu His Ala Leu Leu 
60 

Leu Met Xaa Glu Gly He Asp He 
75 80 
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(2) INFORMATION FOR SEQ ID NO: 406: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
to (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

75 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 

Val Phe Ala Asp Arg Arg Tyr Glu Asp Asp Gly Gin Leu Val Ser Arg 
15 10 15 

20 Lys Glu Ser Asp Ala Val lie Thr Asp Thr Asp Glu Ala Leu Lys Gin 

20 25 30 

Val Leu Lys Met Val Xaa Glu Asn Lys Val lie Ser Lys Asn Asn Lys 
35 40 45 

25 Glu Val Thr Leu Gin Ala Asp Thr lie Cys Val His Gly Asp Gly Glu 
50 55 60 

His Ala Leu Leu Phe Val Ser Gin lie Arg Glu lie Leu Met Lys Glu 
65 70 75 80 

30 Gly He Asp He Gin Ser Leu 

85 



35 



40 



45 



(2) INFORMATION FOR SEQ ID NO: 4 07: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : S ing 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 

Val Tyr Asp He Glu Val Ser Asp Tyr Lys Gly Leu Thr Tyr Lys Leu 

15 10 15 

Glu Ala Phe Arg Gly Lys Val He Leu Val Val Asn Thr Ala Thr Glu 

20 25 30 



55 
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Cys lie Tyr Ser Glu Gin Leu Lys Lys Leu .Glu Thr Leu Xaa Gin Lys 

35 40 45 

Tyr Lys Asp Arg Gly Phe Val Val Leu Ser Ser Pro Asn Asn Asn Cys 

50 55 60 

Asp Asn Arg Gin Pro Xaa Ser Asn Glu Glu lie Leu Lys lie Xaa Arg 
65 70 75 80 

Xaa Glu lie Trp val Leu His Phe Gin Cys 
85 90 

(2) INFORMATION FOR SEQ ID NO: 408: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:408: 

Val Glu lie Glu Asn Glu Pro Tyr Phe Val Gly Lys Asp lie Ala Glu 

1 5 10 15 

He Leu Gly Tyr Ala Arg Ala Asp Asn Ala He Arg Asn His Val Asp 

20 25 30 

Ser Glu Asp Lys Leu Thr His Gin Phe Ser Asp Ser Arg Ser Lys Gin 

35 40 45 

Lys Cys Asn Asp Gin Ser Thr Asn Gin Asp Tyr Thr Val 
50 55 60 

(2) INFORMATION FOR SEQ ID NO:409: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:409: 

Val Arg Xaa Leu He Xaa Glu Ser Xaa He Asp Leu Ala Ser Pro Xaa 
1 5 10 15 
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Asp Asn Arg Pro Phe He Asp Val Phe Leu Glu Pro Thr Lys Leu Tyr 

20 25 30 

Val Xaa Pro Val Leu Ala Leu Lys Lys Glu Val Ser He Lys Ala Met 

35 40 45 

Asn His He Thr Gly Gly Gly Phe Tyr Glu Asn He Pro Arg Ala Leu 

50 55 60 

Pro Ala Gly Tyr Ala Ala Arg He Asp Thr Thr Ser Phe Pro Thr Pro 
€5 70 75 80 

Lys He Phe Asp Trp Leu Gin Gin Gin Gly Asn He Asp Thr Asn Glu 

85 90 95 

Met Tyr Asn lie Phe Asn Met Gly He Gly Tyr Thr Val He Val Asp 

100 105 no 

Glu Lys Asp Ala Ser Thr Arg Phe Glu Asp Phe Ser Arg Thr Lys Cys 

115 120 125 

Gly Ser Leu Ser Asn Trp Ser Tyr Cys Glu Lys 
130 135 

(2) INFORMATION FOR SEQ ID NO: 410; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 

Val Pro He Phe Pro Asn Lys Ala Leu He Glu Ser Ala Val Ala Arg 

15 10 15 

Gly Glu Leu Asp Glu Ser Val Phe Asn Gin Leu Val Thr Asp Met Leu 

20 25 30 

Leu Glu His His Tyr Asn He Pro Gin His Tyr He Asn Leu Tyr He 

35 40 45 

Asp Asn He Lys Thr Leu Lys Asp Val Pro Ala Ser Tyr Met Asn His 

50 55 60 

Ser Asn Val Asp Val Val Ala Asp Leu Leu Leu Glu Lys Ser Lys Arg 
65 70 75 80 

Xaa Tyr Gly 



{2> INFORMATION FOR SEQ ID NO: 411: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 



Val Leu Phe lie Gly Phe Phe Val 

1 5 

Leu Leu Thr Leu Leu Leu Asp Pro 
20 

Pro Thr Lys His Asn Gin Tyr Thr 
35 40 



Val He He Asn Thr He Ala Asp 

10 15 
Lys Gin Arg Leu Gin Leu Gly Asn 
25 30 
He Asp He Arg Lys 
45 



(2) INFORMATION FOR SEQ ID NO: 412: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:412: 

Val Leu He Glu Val His Asp Pro His Glu Leu Glu Arg Ala Xaa Xaa 

15 10 15 

Val Asn Ala Lys Leu He Gly Val Asn Asn Arg Asp Leu Lys Arg Phe 

20 25 30 

Val Thr Asn Val Glu His Xaa Asn Thr He Leu Glu Asn Lys Lys Pro 

35 40 45 

Asn His His Tyr He Ser Glu Ser Gly lie His Asp Ala Ser Asp Val 

50 55 60 

Arg Lys He Leu His Ser Gly He Asp Gly Leu Leu He Gly Glu Ala 
65 70 75 80 

Leu Met Arg Cys Asp Asn Leu Ser Glu Phe Leu Arg Gin Leu Lys Xaa 

85 90 95 

Xaa Lys Val Lys Ser 
100 



309 



EP 0 841 394 A2 



.. . . (2) INFORMATION FOR SEQ ID NO:4J3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B ) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:413: 

Val Ser Asp Glu Pro Xaa lie Tyr Lys Phe lie Gin His Asn Xaa Glu 

15 10 15 

Asn Phe lie Xaa Thr Ala Ser Xaa lie Met Asp Gly His Thr Xaa Val 

20 25 30 

Ala Pro Leu Lys Xaa Thr His Lys Leu Pro Cys Ala Phe Cys Ser Tyr 

35 40 45 

Gin Ser Val Cys His Val Asp Xaa Met lie Asp Ser Lys Arg Tyr Xaa 

50 55 60 

Xaa Val Asp Glu Thr lie Asn Pro lie Glu Ala lie His Asn lie Asn 
65 70 75 80 

lie Asn Asp Val Phe Gly Gly Glu 
85 

(2) INFORMATION FOR SEQ ID NO: 414: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 

Val Asp His Leu Xaa Val Thr Leu Arg He Asp Arg Glu Asn His Met 

15 10 15 

Tyr Gly Arg Leu Ala Ser Glu Ser Val Val Glu Asn Met Phe Thr Pro 

20 25 30 

Val His Asp Asp Asn Leu Lys Asn Glu Val He Glu Ala Lys Pro Trp 
35 40 45 
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Xaa Glu Arg He Thr Asn Trp 
50 55 

(2) INFORMATION FOR SEQ ID NO: 415: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:415: 

Val Xaa Tyr Met Asp Lys Gly Leu Thr Gly His He Met Arg Arg Gly 

15 10 15 

He Thr Glu Ala Asp Ala Ser He Asn Trp Ala Leu Gly Leu Met Asn 

20 25 30 

Glu Gly Ser Gin He He Asp Asn Thr Thr Asn Leu Cys Gly Asp Arg 

35 40 45 

Thr Thr Xaa Xaa Leu Lys Ser Gly Gly Xaa Gly Thr Gly Glu Gin Lys 

50 55 60 

lie Asn Leu Tlir Ser Lys Xaa Ala Gin He Trp 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 416: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416: 

Val Tyr Lys Gin Gly Glu Pro Asn Leu Trp Thr Gly Arg Leu Asp Ser 

15 10 15 

Glu Thr Asp Pro Lys Lys Phe Arg His Phe Gin Thr Val Thr Phe Glu 

20 25 30 

Asp Leu Ser Lys Leu Glu Lys Ser Ser Met Pro Ser Gly Val Gly He 
35 40 45 
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Leu Gly Tyr Ala Val Gly Gin Axg Cys Cys Phe Xaa Gin Gly Ala His Vl 

50 55 60 

Trp Cys Lys Axg Arg Thr Arg Cys Asp 
65 70 



(2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 



Val His His lie Thr His Tyr lie Asp Gin Leu Asp Arg Phe Ser Ser 

15 10 15 

Pro Gly Asp Val lie Lys Xaa His Ser Gly Gly Tyr His His Lys Tyr 

20 25 30 

Gly Thr Phe Asn Lys Leu Gly Tyr lie Asn Glu Asn Tyr lie Glu Leu 

35 40 45 

Leu Asp Val Glu Asn Asn Glu Lys Leu Lys Lys Met Ala Xaa Thr lie 

50 55 60 

Glu Gly Gly Val Ala Cys Ala Thr Gin He Ala Gin Glu Lys Tyr Glu 
65 70 75 80 

Gin Gly Phe Lys Asn Met Cys Val Xaa Thr Asn Asp He Glu Ala Val 

85 90 95 

Lys Asn Asn Leu Gin Xaa Glu Gin Val Xaa Val Val Ala Pro Thr Gin 

100 105 110 

Met Glu Arg Asp Thr His Lys Asp Gly Lys Val Lys Trp Gin Leu Leu 
115 120 125 

He Leu 
130 



(2) INFORMATION FOR SEQ ID NO: 418: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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.. . (ii) MOLECULE TYPE: Protein . v 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:418: 

Val Arg Gly Ser Lys lie Gin Thr lie Gly Val lie Leu Pro Ser Leu 

15 10 15 

Thr Asn Pro Phe Phe Ser Ala Leu Met Gin Ser lie His Asp His Lys 

20 25 30 

Pro Ser Asp Val Asp Leu Cys Phe Leu Thr Ser Thr Ala Thr Asp Xaa 

35 40 45 

Tyr Asp Asn lie Lys His Leu lie Asp Arg Gly lie Asp Gly Leu He 

50 55 60 

He Ala Gin Tyr He Ser Ser Pro Asp Ala Leu Asn Asn Tyr Leu Lys 
65 70 75 80 

Lys His His Val Pro Tyr Val Val Leu Asp Gin Lys 
85 90 

(2) INFORMATION FOR SEQ ID NO: 419: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 

Val Leu Xaa Asp Glu Xaa Xaa Val Asp Lys Asp Ala Leu Arg Asn Asn 

15 10 15 

Ala Leu Val Lys Gly Gin Phe Lys Ala Asp His Gin Tyr Gin He Val 

20 25 30 

He Gly Pro Gly Xaa Val Asp Glu Val Tyr Lys Gin Phe He Asp Glu 

35 40 45 

Thr Gly Ala Gin Glu Ala Ser Lys Asp Glu Ala Lys Gin Ala Ala Ala 

50 55 60 

Lys Lys Gly Asn Pro Val Gin Arg Leu He Lys Leu Leu Gly Glu He 
65 70 75 80 

Phe He Pro He Leu Pro Ala He Val Thr Thr Gly Leu Leu Met Gly 
85 90 95 

He Gin 
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. (2) INFORMATION FOR SEQ ID NO: 420: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420 : 

Val lie Gin Leu Leu Met His Met He His Tyr Gly Thr Ser Val His 

15 10 15 

He He Arg Thr Gin Ser He Leu Asn Asp Asp Lys Val Asn Gin Val 

20 25 30 

Cys Asp Tyr He Glu Leu His Phe His Glu Asp Leu Ser Leu Ser Glu 

35 40 45 

Leu Ser Glu Tyr Val Gly Trp Ser Glu Ser His Leu Ser Lys Lys Phe 

50 55 60 

Thr Glu Ser Leu Gly Val Gly Phe Gin His Phe Leu Asn Thr Thr Arg 
65 70 75 80 

He Glu His Ala Lys Leu Asp Leu Thr Tyr Thr Asp Glu Thr He Thr 

85 90 95 

Asp He Ala Leu Gin Asn Gly Phe Ser Ser Ala Ala Ser Phe Ala Arg 

100 105 HO 

Thr Phe Lys His Phe Thr His Gin Thr Pro Lys Gin Tyr Arg Gly Asp 

115 120 125 

Arg Pro Ala He Thr Glu Asn Gin Gin Ser Ala Gin His Asn Tyr His 

130 135 140 

Asp Arg Glu Leu He Leu Leu Leu Asn Asp Tyr He Glu Glu Met Asn 
145 150 155 160 

Gin Phe Asn 



(2) INFORMATION FOR SEQ ID NO:421: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 



Val Asn Leu Gly Gly Ser Val Thr Ser lie Gin Pro Leu Arg lie Asn 

15 10 15 

Leu Thr Ser Asn Glu Asn Phe Thr Asp Lys Asp Trp Gin lie Thr Gly 

20 25 30 

lie Pro Arg Thr Leu His lie Glu Asn Ser Thr Asn Arg Thr Asn Asn 

35 40 45 

Ala Arg Glu Arg Asn lie Glu Leu Val Gly Asn Leu Leu Pro Gly Asp 

50 55 60 

Tyr Phe Gly Thr lie Arg Phe Gly Arg Lys Glu Gin Leu Phe Glu lie 
65 70 75 80 

Arg Val Xaa Pro His Xaa Pro Gin Leu Gin Arg Gin Leu Ser Asn Xaa 

85 90 95 

Glu Val Arg Asn Tyr Lys Ser Ala Cys 
100 105 



(2) INFORMATION FOR SEQ ID NO: 422: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 



Val lie Lys Arg Leu lie Asn Glu Thr Phe Asp Ala Asn Tyr lie Glu 

15 10 15 

Val He Glu Gly Gly He Glu Glu Thr Gin Thr Leu He His Leu Pro 

20 25 30 

Phe Asp Tyr Val Ser Tyr Arg Ser Glu He Val Gly Lys Ser Phe He 

35 40 45 

Lys Leu Gin Arg Asn Leu Val Leu 
50 55 

(2) INFORMATION FOR SEQ ID NO: 42 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 
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•(C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 

Val lie Thr Phe Val Leu Val Pro Val Leu lie Pro Thr Leu Lys Arg 

15 10 15 

Met Lys Phe Gly Gin Ser lie Arg Glu Glu Gly Pro Gin Ser His Met 

20 25 30 

Lys Lys Thr Gly Thr Pro Thr Met Gly Gly Leu Thr He Ser He Lys 

35 40 45 

Val Phe Val He Thr Val Phe Gly Gly Tyr Tyr He Cys Lys He Lys 

50 55 60 

Leu He Gin Ser Tyr Leu Leu Leu Phe Val Thr Asp Trp Phe Trp Val 
65 70 75 80 

Asn Trp Phe Tyr Arg 
85 

(2) INFORMATION FOR SEQ ID NO: 42 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 

val Met Xaa Asn Gly Xaa Leu He Glu His Gly Thr Arg Glu Ser Val 

15 10 15 

Leu His His Pro Glu His Val Tyr Thr Lys Tyr Leu Leu Ser Xaa Xaa 

20 25 30 

Lys Lys Xaa Asn Asp His Phe Lys His Val Met Arg Gly Asp Val His 
35 40 45 

Xaa 



(2) INFORMATION FOR SEQ ID NO: 425: 
(i) SEQUENCE CHARACTERISTICS: 
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.(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNE SS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 



Val Pro Thr He Asp Ser Val Lys 

1 5 
Gly Met Met Thr Gly Leu He He 
20 

He Lys Phe Gin Leu Leu Val Val 

35 40 
Met Ser Ala Leu He Ala Thr Tyr 

50 55 
Ala Arg His Gin Leu Val Ala Arg 
65 70 



Thr Tyr Gly Leu Val Ser He Pro 

10 15 
Gly Gly Val Pro Pro Leu Gin Ala 
25 30 
Phe He His Thr Thr Ala Xaa He 
45 

Leu Ser Tyr Gly Gin Phe Phe Asn 
60 

Asn Thr Asp Val Lys Ser Glu Ser 
75 80 



(2) INFORMATION FOR SEQ ID NO: 426: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 



Val Met Pro Xaa Val Phe Trp Xaa Pro Phe Pro Xaa He Phe He Gly 

15 10 15 

Thr Ala Leu Pro Leu Ala Gly Thr Val Ala Thr Gly Ala He His Phe 

20 25 30 

Thr Ala Asn Glu Val He Pro He Gly Xaa Met Leu Xaa Asn Asn Gly 

35 40 45 

Leu He Ala He Asn Leu Ala Tyr Gin Asn Leu Asp Arg Ala Phe Val 

50 55 60 

Gin Asp Gly Thr Asn He Glu Ser Lys Leu Ser Leu Ala Ala Thr Pro 
65 70 75 80 

Lys Leu Ala Ser Lys Gly Ala He Arg Glu Ser He Arg Leu Ala lie 
85 90 95 
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Gly Ala Asn Asn 
100 



20 



35 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 427: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 



Val Lys Val Gly Lys Ser Met Glu Gly Leu Asn His Arg Arg Asn Thr 

1 5 10 15 

Glu Lys Glu Glu Thr Thr Gin Thr Gin Ser Val Ala Pro Asn Thr Gly 
20 25 30 

25 Glu Glu Gly Met Ser Ser Gly Lys 

35 40 

(2) INFORMATION FOR SEQ ID NO: 428: 

30 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 

Val Gin Phe Leu Asn Gin Trp Leu Gly Asp Thr Leu Ala Phe Lys Val 

15 10 15 

Lys His Met Leu Arg Gin Arg Val He Tyr Lys Asn Asn Gly His Pro 

20 25 30 

He Gly Glu Gin Met Thr He Leu Thr Glu Asn He Asp Gly Leu Ala 

35 40 45 

Pro Phe Tyr Lys Ser Tyr Leu Pro Gin Val Val Gin He Asp 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 42 9: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 116 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:429: 



Val Val He Xaa He Ala Leu He Ser Xaa Val Phe Leu Pro Gly He 

15 10 15 

Phe Tyr Val He Ser Cys Val Val Gly Tyr Leu Cys Phe Gly Leu Gly 

20 25 30 

Leu Gly He Tyr Ala Thr Pro Ser Thr Asp Thr Ala He Ser Asn Ala 

35 40 45 

Pro Leu Asp Lys Val Gly Val Ala Ser Gly He Tyr Lys Met Ala Ser 

50 55 60 

Ser Leu Gly Gly Ala Phe Gly Val Ala He Ser Gly Ala Val Tyr Val 
65 70 75 80 

Gly Ala Val Ala Ala Thr Ser He His Thr Gly Ala Met He Ala Leu 

85 90 95 

Trp Val Asn Val Leu Met Gly He Met Ala Phe He Ala He Leu Phe 

100 105 110 

Gly Ala Xaa Ser 
115 



(2) INFORMATION FOR SEQ ID NO: 430: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 0: 



Val Glu Leu Gly Tyr Leu Phe Arg Asn Tyr Arg Leu Phe Asn Met Asp 

15 10 15 

Gly Leu Ala Leu Lys Leu Asn Leu Ser Ser Cys Leu Phe Ser Arg Lys 
20 25 , 30 
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.Ser Met lie Tyr Phe Asn lie Gly Gin lie. lie Ala Asn He He Cys, 

35 40 45 

Trp Ala Leu He Ala Pro Thr Leu Asp He Leu He Tyr Asn Glu Pro 

50 55 60 

Ala Asn Lys Val Tyr Thr Gin Gly Val He Ser Ala Val Leu Asn He 
65 70 75 80 

He Ser Val Gly He He Gly Thr He Leu Leu Lys Ala Tyr Ala Ser 

85 90 95 

Ser Gin He Lys Lys Gly Ser Leu Arg Lys Glu 
100 105 

(2) INFORMATION FOR SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431: 

Val Tyr Ala Gly Glu Asn Phe Met He Leu Leu Ala Ser Arg Leu Asp 

15 10 15 

Ala Val Val Tyr Ser Leu Gly Leu Ala Arg Thr Arg Arg Gin Ala Arg 

20 25 30 

Gin Leu Val Asn His Gly His He Leu Val Asp Gly Lys Arg Val Asp 

35 40 45 

He Pro Ser Tyr Ser Val Lys Pro Gly Gin Thr He Ser Val Arg Glu 

50 55 60 

Lys Ser Gin Lys Leu Asn He He Val Glu Ser Val Glu lie Asn Asn 
65 70 75 80 

Phe Val Pro Glu Tyr Leu Asn Phe Asp Ala Asp Ser Leu Thr Gly Thr 

85 90 95 

Phe Val Arg Leu Pro Glu Arg Ser Glu Leu Pro Ala Glu He Asn Glu 

100 105 HO 

Gin Leu He Val Glu Tyr Ser Cys Gin Asp Asn Thr Phe He Thr He 

115 120 125 

Thr His Asn Cys Gly Cys Phe Phe Tyr 
130 135 

(2) INFORMATION FOR SEQ ID NO: 432: 
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(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s i ng 1 e 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:432: 



Val His Leu Val Arg Met Val Tyr Lys Gly Lys lie Ser Asp His Tyr 

15 10 15 

Leu Ser Val Trp Glu Lys Ala Thr Thr Tyr Gin Met Tyr His Gly Leu 

20 25 30 

Ala Leu Leu lie He Gly Val He Ser Gly Thr Thr Ser He Asn Val 

35 40 45 

Asn Trp Ala Gly Trp Leu He Phe Ala Gly He He Phe Phe Ser Gly 

50 55 60 

Ser Leu Tyr He Leu Val Leu Thr Gin He Lys Val Leu Gly Ala He 
65 70 75 80 

Thr Pro He Gly Gly Val Leu Phe He He Gly Trp lie Met Leu He 

85 90 95 

He Ala Thr Phe Lys Phe Ala Gly 
100 

(2) INFORMATION FOR SEQ ID NO: 433: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:433: 



Val Leu Lys Leu Phe Gin Met Arg Ser Pro He Phe Arg Glu Pro Ser 

15 10 15 

Ala Asn Asn Ala Val Lys Thr Leu lie Ala Met Gly Ser lie Leu Ala 

20 25 30 

Phe Leu Leu Val Gly lie Gly Gly Leu Ala Tyr Val Tyr Gly lie Met 
35 40 45 
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. Pro Gin Thr Glu Thr Thr Val Leu Ser Gin. Leu Ala Met Gin lie Phe--- 
50 55 60 

Gly 
65 



(2) INFORMATION FOR SEQ ID NO: 434: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434: 



Val Met Arg Gly Thr lie lie lie 

1 5 
Ala Leu Asn Ser Pro Arg Pro Asp 
20 

Thr Lys Val Asn Ala Lys Lys Pro 

35 40 
Lys lie Ser Asn lie Gly Leu He 

50 55 
Tyr Ser Leu Lys 
65 



Pro Thr Thr Lys Pro Gly Leu He 

10 15 
Met Lys Asp Leu Asn Thr Gly Val 
25 30 
Asn Thr He Val Gly He Pro Ala 
45 

Met Arg Arg Ala Arg Ala Leu Ala 
60 



(2) INFORMATION FOR SEQ ID NO: 435: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 5: 



Val Ser Asp Thr Ala val Met He Ala Trp Leu Val Lys He Pro His 

15 10 15 

Ser Thr Met Pro He Leu Gly Thr Ser Gin Leu Lys Arg Xaa Asp Gin 
20 25 30 
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Ala He Glu Gly Leu Gin Leu Asn Leu Asp. Asp Gin Val Val Gly v 
35 40 45 

{2) INFORMATION FOR SEQ ID NO:436: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 6: 

Val Asp He Pro Leu Leu Phe Glu Asn Glu Leu Glu Asn Thr Val Asp 

15 10 15 

Glu Val Trp Val Val Tyr Thr Ser Glu Ser He Gin Met Asp Arg Leu 

20 25 30 

Met Xaa Arg Xaa Asp Leu Ser Leu Glu Asp Ala Lys Ala Arg Val Tyr 

35 40 45 

Xaa Pro Asn Phe Tyr 
50 

(2) INFORMATION FOR SEQ ID NO: 437: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437: 

Val Met Gin Val Phe Thr Trp Gin Gin Tyr Val Glu He Val Val Asn 

15 10 15 

Glu Gly Arg Asp Ala Ala Asn Ala Ala Gin Glu Lys Ala Val Lys Glu 

20 25 30 

Gly Lys He He He Lys Asp Ser He Ala Asp He Phe Leu Gin Gin 

35 40 45 

He Leu Thr Arg Pro Ala Glu His Asp Val Val Ala Thr Met Asn Leu 
50 55 60 
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Asn Gly Asp Tyr lie Ser Asp Ala Leu Ala. Ala Gin Val Gly Xaa Ile Vt 
65 70 75 80 

Gly lie Ala Pro Gly Ala Asn lie Asn Tyr Glu Thr Gly His Ala lie 

85 90 95 

Phe Glu Ala Thr His Gly Leu Xaa Ser Lys He Cys Arg Phe Lys 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 438: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 

Val Ser Met Phe He Thr Gly Met Gin Tyr Gly Asp Lys Val Ala Val 

15 10 15 

His Val Ser Arg Gly Ala Val Phe Gly Met Thr Gly Val Leu Val Val 
20 25 30 

Phe 



(2) INFORMATION FOR SEQ ID NO:439: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439: 



Val Val Thr Asp Gin Leu Leu Ala 

1 5 
Ser Gin Phe Asn Pro Arg Gly Gly 
20 

Tyr Ala Asn Gly Gly Leu He Thr 
35 40 



Phe Phe Asn Asn Arg Tyr Trp Arg 

10 15 
Trp Ser Pro Ser Gly Pro Arg Arg 
25 30 
Lys His Gin Leu Xaa Glu Val Gly 
45 
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. .Glu Gly Asp Lys Gin Glu Met Val lie Pro- Leu Thr Arg Arg Lys Arg«-'. 
50 55 60 

Ala He Gin Leu Thr Glu Gin val Met Arg He He Gly Met Asp Gly 
65 70 75 80 

Xaa Pro Asn Asn lie Thr Val Asn Asn Asp Thr Ser Thr Val Glu Lys 

85 90 95 

Leu Phe Glu Thr Asn Cys Tyr Val Lys 
100 105 

(2) INFORMATION FOR SEQ ID NO: 440: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440: 



Val Ala Gly Ala Leu Asp Val Asn Gin Val Asn Val Ser Glu Asn Asp 

15 10 15 

Asn Ala Asn Gin Pro His Ser Val Leu Leu He Asp Thr Gin Ala Val 

20 25 30 

Asp Glu Asn Asn Ser Glu Leu Asn Gin Val Gly Thr Ser Thr Lys Ala 

35 40 45 

Gin He Ala fhe Cys He Asp Val Arg Ser Glu Pro Phe Arg Arg His 

50 55 60 

He Glu Ala Ala Gly Pro Phe Glu Thr He Gly lie Ala Gly Phe Phe 
65 70 75 80 

Gly Leu Pro He Gin Lys Asp Ala Val Asp Glu Gin Phe Lys His Asp 

85 90 95 

Ser Leu Pro Val Met Ser Arg Ala Ala Gly He Ser His 
100 105 



(2) INFORMATION FOR SEQ ID NO: 441: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(i.i) MOLECULE TYPE: Protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO; 441: 

Val Thr Phe Gly Val Thr Gly Thr Asn Gly Lys Thr Ser lie Ala Thr 

1 5 10 15 

Asp Asp Ser Phe Asn Ser Xaa Lys Val Thr Lys Lys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 442: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 110 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442: 



35 



45 



Val Asp Gly Thr Ala Asn Gly Val Gly Ser Thr Leu Xaa Leu Asn Glu 

15 10 15 

Ser Leu Asp Gin Phe lie Leu Leu lie Phe Tyr Gly Thr Phe Pro Gly 

20 25 30 

Gly Asp Phe Thr Glu Phe Gly Ser Pro Phe Gly Gly Gly Lys lie Ser 

35 40 45 

Leu Asn Pro Ser Asn Leu Pro Asp Gly Asp Gly Asn Gly Gly Gly Val 

50 55 60 

Tyr Glu Phe Gly Leu Thr Lys Ser Ser Arg Thr Ser Leu Thr lie Ser 
65 70 75 80 

Asn Asp Val Tyr Phe Asp Leu Gly Ser Gin Arg Gly Ser Gly Ala Asn 

85 90 95 

Ala Asn Arg Gly Thr lie Asn Lys lie lie Gly Val Arg Lys 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 4 43: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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15 



20 



25 



35 



40 



45 



.. . (ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:443: 

Val Leu Asp Asn Asp Leu Asn Gin Val Thr Leu Ala Asp Tyr Ala Gly 

15 10 15 

Lys Lys Lys Leu He Ser Val Val Pro Ser He Asp Thr Gly Val Cys 

20 25 30 

Asp Gin Gin Thr Arg Lys Phe Asn Ser Glu Ala Ser Lys Glu Glu Gly 

35 40 45 

He Val Leu Thr He Ser Ala Asp Leu Pro Phe Ala Gin Lys Arg Trp 

50 55 60 

Cys Ala Ser Ala Gly Leu Asp Asn Val He Thr Leu Ser Asp His Arg 
65 70 75 80 

Asp Leu Ser Phe Gly Glu Asn Tyr Gly Val Val Met Glu Arg Thr Ser 

85 90 95 

Cys Arg He Arg His Glu Leu Val Gin Tyr Leu Tyr 
100 105 

(2) INFORMATION FOR SEQ ID NO: 444: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444: 

Val Ser Gin Glu Arg Tyr Ser Arg Gin He Leu Phe Lys Gin He Gly 

15 10 15 

Glu He Gly Gin Ser Lys He Asn Gin Lys Cys Ala Leu He He Gly 

20 25 30 

Met Gly Ala Leu Gly Thr His Val Ala Glu Gly Leu Val Arg Ala Gly 

35 40 45 

He Ala Lys Leu He He Val Asp Arg Asp Tyr He Glu Phe Ser Asn 

50 55 60 

Leu Gin Arg Gin Thr Leu Phe Thr Glu Glu Asp Ala Leu Lys Met Met 
65 70 75 80 

Pro Lys Val Val Ala Ala Lys Lys His Leu Leu Ala Leu Arg Ser Asp 
85 90 95 



327 



EP 0 841 394 A2 



.Val Asp lie Asp Gly Cys lie Ala His Val. Asp Tyr Tyr Phe Leu Gly.... 

100 105 110 

Asn Thr Trp Thr Gly Arg Trp Thr Leu Leu Leu Met Gin Pro lie Thr 

115 120 125 

Leu Lys His Asp Asn 
130 

(2) INFORMATION FOR SEQ ID NO: 445: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



30 



40 



45 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445: 

Val Thr Leu Met Lys Leu Val Phe Val Ala Arg Ala Gly Asn Met Ala 

15 10 15 

Gin Ala lie Phe Thr Gly lie lie Asn Ser Ser Asn Leu Asp Ala Asn 

20 25 30 

Asp He Tyr Leu Thr Asn Lys Ser Asn Glu Gin Ala Leu Lys Ala Phe 

35 40 45 

Ala Glu Lys Leu Gly Val Asn Tyr Ser Tyr Asp Asp Ala Thr Leu Leu 

50 55 60 

Lys Asp Ala Asp Tyr Val Phe Leu Gly Thr Lys Pro His Asp Phe Asp 
65 70 75 80 

Ala Leu Ala Thr Arg He Lys Pro His He Thr Lys Asp Xaa Cys Phe 

85 90 95 

Met Ser He Met Ala Gly He Pro Thr Asp Tyr He Xaa Gin Gin Leu 

100 105 110 

Glu Cys Gin Asn Pro Xaa Ala Arg He Met Pro Xaa Thr Xaa Ala Xaa 

115 120 125 

Val Gly His Ser Val Thr Gly He Ser Phe Ser Asn Asn Phe Glu Pro 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 446: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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ID) TOPOLOGY: linear . _. , 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:446: 

Val His Arg Ser Leu Met Lys lie Leu lie Asn Val Leu Glu Gin Phe 

1 5 10 15 

Asp Asn Ala Val Val Leu Phe Met Gly Ala Gly Asp lie Gin Lys Leu 

20 25 30 

Gin Asn Ala Tyr Leu Asp Lys Leu Gly Met Lys Asn Ala Phe 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 447: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:447: 

Val Phe Glu Asn Glu Pro Leu Lys Pro Asn His Glu Leu Tyr Glu Leu 

15 10 15 

Glu Asn Val Thr lie Thr Ala His lie Thr Gly Asn Asp Tyr Glu Ala 

20 25 30 

Lys Tyr Asp Leu Leu Asp lie Phe Lys Asn Asn Leu Val Asn Phe Leu 

35 40 45 

Asn Lys Asn Gly Leu lie Glu Asn Glu Val Asp Ala Lys Lys Gly Tyr 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 448: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 86 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 
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10 



15 



25 



35 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID.NO:448: 

Val Lys Glu He Thr Lys Leu Asn Gly Leu Cys Leu Lys Trp Val Ala 

15 10 15 

Pro Gly Thr Arg Gly Val Pro Asp Arg He He lie Met Pro Glu Gly 

20 25 30 

Lys Thr Tyr Phe Val Glu Met Lys Gin Glu Lys Gly Lys Leu His Pro 

35 40 45 

Leu Gin Lys Tyr Val His Arg Gin Phe Glu Asn Arg Asp His Lys Val 

50 55 60 

Tyr Val Leu Trp Asn Lys Glu Gin Val Asn Thr Phe He Arg Met Val 
65 70 75 80 

Gly Gly Thr Phe Gly Asp 
85 

(2) INFORMATION FOR SEQ ID NO: 449: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: 

Val Trp Gly He Val He Leu Gly Gly Tyr Glu Gin Phe He Lys Ser 

15 10 15 

His Leu Arg Lys He Tyr He Asp Gly Val Ser Asn Met Gin Glu His 

20 25 30 

Val Val Val Thr Leu Asp Gly Lys Asp Tyr Leu Val Glu Pro Gly Thr 

35 40 45 

Asn Leu Leu Glu Phe He Lys Ser Gin Asp Thr Phe Val Pro Ser He 

50 55 60 

Cys Tyr Asn Glu Ser Met Gly Pro He Gin Thr Cys Asp Thr Cys Thr 
65 70 75 80 

Val Glu He Asp Gly Lys He Glu Arg Ser Cys Ser Thr Val He Asp 

85 90 95 

Arg Pro Met Thr Val Asn Thr Val Asn Asn Asp Val Lys Asp Ala Gin 
100 105 110 

Lys Glu Pro 
115 
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- (2) INFORMATION FOR SEQ ID NO: 45.0: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 



Val Leu Thr Met lie Ser Tyr Leu 

1 5 
Asn Gly Leu lie Asn Met Asn Lys 
20 

Asp Ala lie Xaa Leu Asn Lys Asp 
35 40 

Cys Phe 
50 



Leu Phe Leu Leu Ser Gly Leu Ala 

10 15 
Glu Gly lie Asp Lys Trp Gin Ala 
25 30 
Ala Asn Gin Thr Val Gin Xaa Ser 
45 



(2) INFORMATION FOR SEQ ID NO: 451: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451: 



Val Lys Lys Val Val Lys Tyr Leu 

1 5 
He Val Leu Phe Val Gin Thr Phe 
20 

Asn Asn Asp Met Ser Pro Thr Leu 

35 40 
Asn Lys He Lys Val Thr Phe Asn 

50 55 
Thr Tyr Arg Arg Gly Asn Glu He 
65 70 



He Ser Leu He Leu Ala He He 

10 15 
Val He Val Gly His Val He Pro 
25 30 
Asn Lys Gly Asp Arg Val He Val 
45 

Gin Leu Asn Asn Gly Asp He He 
60 

Tyr Thr Ser Arg He He Ala Lys 
75 80 
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Pro Gly Gin Ser Met 
85 

Arg Pro Val Asp Ala 
100 

Ser Leu Arg Asn Phe 
115 

Asn Phe Val Val Leu 
130 

Gin Phe Gly Leu He 
145 

Arg Tyr Tyr Pro Phe 
165 



Ala Phe Arg Gin Gly. Gin 
90 

Ser Tyr Ala Lys Asn Arg 
105 

Lys Glu Leu Asp Gly Asp 
120 

Asn Asp His Asp Asn Asn 
135 

Asp Lys Lys Asp He He 
150 155 
Ser Lys Trp Thr He Gin 
170 



Leu Tyr Arg Asp Asp v> 
95 

Lys He Lys Asp Phe 
110 

He He Pro Pro Asn 
125 

Gin His Asp Ser Arg 
140 

Gly Asn He Ser Leu 
160 

Phe Lys Ser 



(2) INFORMATION FOR SEQ ID NO:452: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:452: 



Val Ser Lys Leu Lys Lys Glu Leu Leu Glu Trp He He Ser He Ala 

15 10 15 

Val Ala Phe Val He Leu Phe lie Val Gly Lys Phe He Val Thr Pro 

20 25 30 

Tyr Thr He Lys Gly Glu Ser Met Asp Pro Thr Leu Lys Asp Gly Glu 

35 40 45 

Arg Val Ala Val Asn He He Gly Tyr Lys Thr Gly Gly Leu Glu Lys 

50 55 60 

Gly Asn Val Val Val Phe His Ala Asn Lys Asn Asp Asp Tyr Val Lys 
65 70 75 80 

Arg Val He Gly Val Pro Gly Asp Lys Val Glu Tyr Lys Asn Asp Thr 

85 90 95 

Leu Tyr Val Asn Gly Lys Lys Gin Asp Glu Pro Tyr Leu Asn Tyr Asn 

100 105 110 

Leu Lys His Lys Gin Gly Asp Tyr He Thr Gly Thr Phe Gin Val Lys 

115 120 125 

Asp Leu Pro Glu Cys Glu Ser Gin He Lys Cys Gin Ser Lys Arg Val 
130 135 140 
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. Asa lie - 
145 

(2) INFORMATION FOR SEQ ID NO: 453: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:453: 

Val Glu Arg Asn Gin Lys Val Pro Xaa Gly Pro Glu Asn Pro Glu Lys 

15 10 15 

Pro Ser Arg Pro Thr His Pro Xaa Gly Pro Val Asn Pro Asn Asn Pro 

20 25 30 

Gly Leu Ser Xaa Asp Arg Ala Lys Pro Asn Gly Pro Gly Pro Phe Asn 

35 40 45 

Tip lie Lys Met He Lys Val Lys Lys Ser Lys He Ala Lys Glu Ser 

50 55 60 

Val Ala Asn Gin Glu Lys Lys Arg Ala Glu Leu Pro Lys Thr Gly Leu 
65 70 75 80 

Glu Ser Thr Gin Lys Gly Leu He Phe Ser Ser He He Gly lie Ala 

85 90 95 

Gly Leu Met Leu Leu Ala Arg Arg Arg Lys Asn 
100 105 

(2) INFORMATION FOR SEQ ID NO: 454: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:454: 

Val Thr His Val Asp Val Asp Glu Arg Leu lie Asp Phe Gin He Val 
15 10 15 
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10 



Gly Met Pro Leu Pro Lys Asn Asp Arg Ser Gin Arg Pro Ala Arg Gly ^ 

20 25 30 

Lys Thr lie Gin Ala Lys Thr Arg Gly Lys Ser Leu Asp Lys Ser Lys 

35 40 45 

Ser Asp Asp Lys Gly Arg 
50 

<2) INFORMATION FOR SEQ ID NO: 455: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:455: 

25 

Val Phe His lie Leu Gin Asn lie Gly Met Thr lie Gin Leu Leu Pro 

15 10 15 

lie Thr Gly lie Pro Leu Pro Phe lie Ser Tyr Gly Gly Ser Ala Leu 
30 20 25 30 

Trp Ser Met Met Thr Gly He Gly He Val Leu Ser He Tyr Tyr His 

35 40 45 

Glu Pro Lys Arg Tyr Val Asp Leu Tyr His Pro Lys Ser Asn 
35 50 55 60 

(2) INFORMATION FOR SEQ ID NO: 456: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 1 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456: 

Val Thr Xaa Arg Glu Val Val Asn His He Glu Gin Thr He Xaa Gin 

15 10 15 

55 Tyr Gly Val Asn Glu Met Ser Phe Asp Thr Met Val Leu Phe Gly Asp 

20 25 30 
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His Xaa Ala Ser Pro His Gly Thr Pro Gly^Asp Arg Arg Leu Lys Ser ^ 

35 40 45 

Asn Glu Tyr Val Leu Phe Asp Leu Gly Val lie Tyr Glu His Tyr Cys 

50 55 60 

Ser Asp Met Thr Arg Thr He Lys Phe Trp 

65 70 

(2) INFORMATION FOR SEQ ID NO: 457: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 amino acids* 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 

Val Gin Val Gly Asp Gly Pro Xaa Gin Arg Glu He Val Phe Pro Asn 

15 10 15 

Ser Thr Asp Xaa Xaa Ser Trp Gly Gly Xaa Thr Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 458: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:458: 

Val Gly Pro Asn Thr Glu Ala Glu Phe Arg Lys Glu He Gin Leu Pro 

15 10 15 

Gly Asn Ala Ser Trp Gly Val Gly Pro Asn Thr Glu Xaa He Gly Phe 

20 25 30 

Pro He Ser Ser Asp Asn Ala Ser Trp Gly Val Gly Pro Asn Lys Glu 

35 40 45 

Asn Phe Glu Lys Lys Phe Tyr Arg Gin Cys Glu Leu Gly Cys Gly Pro 
50 55 60 
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Gin 



His Arg Ser Leu Arg 
70 



65 



(2) INFORMATION FOR SEQ ID NO: 459: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:459: 

Val Thr Arg Leu Phe Tyr Met Phe Asp Lys Glu Thr Met He Ala Ser 

15 10 15 

Met Gly He Gly Gly Gly Leu Gly Asn Ala Ala Leu Phe Thr Arg Phe 



(2) INFORMATION FOR SEQ ID NO: 460: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460: 

Val Ser Ala He Val Xaa Tyr Xaa Ser Phe Asp Cys Xaa He Arg Lys 

15 10 15 

Phe Lys Leu Lys Thr Pro Gly Arg Glu Asp Glu Glu Thr Glu He Arg 

20 25 30 

Asn Ser Ser Val Ala Lys Leu Pro Xaa Asp Val Leu Asp Xaa Met Gly 

35 40 45 

Trp Lys Arg Lys His 



20 



25 



30 



50 



(2) INFORMATION FOR SEQ ID NO: 461: 



U) SEQUENCE CHARACTERISTICS: 
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10 



(A) LENGTH: 136 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461: 



Val Leu Arg Met Thr Glu Gin Pro Ser Tyr Tyr Ser lie lie Thr Ala 
15 1 5 10 15 

Asn Val Arg Tyr Asp Asn Arg Leu Thr Asp Ser Glu Lys Leu Leu Phe 

20 25 30 

Ala Glu lie Thr Ser Leu Ser Asn Lys Tyr Gly Tyr Cys Thr Ala Ser 
20 35 40 45 

Asn Gly Tyr Phe Ala Lys Leu Tyr Glu Val Thr Lys Val Thr Val Ser 

50 55 60 

Arg Arg lie Ala Asn Leu Lys Glu Cys Gly Tyr Leu His Val Glu lie 
25 6 5 7 0 7 5 8 0 

lie Arg Asn Gly Asn Glu He Lys Gin Arg Lys Leu Tyr Pro Leu Thr 

85 90 95 

Glu Met He Arg Pro He Asn Thr Asn Asp Asn Thr Pro He Asn Asn 
30 100 105 110 

Ser Val Asn Thr Pro He He Thr Asn Val Lys Glu Thr He Gin Val 

115 120 125 

Leu He He Gin Val He Thr He 
35 130 135 

(2) INFORMATION FOR SEQ ID NO: 462: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 290 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

50 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462: 

Val Gly Phe Xaa Ala Ala He Arg Met Asp Lys Ser Gly Tyr Ser Xaa 
15 10 15 

55 Ser Leu Tyr Glu Gin Asn Thr His He Gly Gly Lys Val Asn Xaa His 

20 25 30 
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Glu Ser Asp Gly Phe Gly Phe Asp Leu Gly. pro Ser lie Leu Thr Met v 

35 40 45 

Pro Tyr lie Cys Glu Lys Leu Phe Glu Tyr Ser Lys Lys Gin Met Ser 

50 55 60 

Asp Tyr Val Thr lie Lys Arg Leu Xaa His Gin Trp Arg Ser Phe Phe 
65 70 75 80 

Pro Asp Gly Thr Thr lie Asp Leu Tyr Glu Gly He Lys Glu Thr Gly 

85 90 95 

Gin His Asn Ala He Leu Ser Lys Gin Asp lie Glu Glu Leu Gin Asn 

100 105 110 

Tyr Leu Asn Tyr Thr Arg Arg He Asp Arg He Thr Glu Lys Gly Tyr 

115 120 125 

Phe Asn Tyr Gly Leu Asp Thr Leu Ser Gin He He Lys Phe His Gly 

130 135 140 - 

Pro Leu Asn Ala Leu lie Asn Tyr Asp Tyr Val His Thr Met Gin Gin 
145 150 155 160 

Ala He Asp Lys Arg He Ser Asn Pro Tyr Leu Arg Gin Met Leu Gly 

165 170 175 

Tyr Phe He Lys Tyr Val Gly Ser Ser Ser Tyr Asp Ala Xaa Ala Val 

180 185 190 

Leu Ser Met Leu Phe His Met Gin Gin Glu Gin Gly Xaa Xaa Tyr Val 

195 200 205 

Glu Gly Gly He His His Xaa Ala Asn Ala Leu Glu Lys Leu Ala Arg 

210 215 220 

Glu Glu Gly Val Thr He His Thr Gly Ala Arg Val Asp Asn He Lys 
225 230 235 240 

Thr Tyr Gin Arg Arg Val Thr Gly Val Arg Leu Asp Thr Gly Glu Phe 

245 250 255 

Val Lys Ala Asp Tyr He He Ser Asn Met Glu Val He Pro Thr Tyr 

260 265 270 

Lys Tyr Leu He His Leu Gly Tyr Ser Thr He Lys Gin He Arg Glu 
275 280 285 

Gly He 
290 

(2) INFORMATION FOR SEQ ID NO: 463: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 63: 

Val Arg Thr Ser Arg Gin Ser Pro Phe Ala Gin Arg He Glu Gin Gin 

15 10 15 

Gin Lys Arg Leu Asn Leu Pro Asp Leu Pro Thr Thr Thr He Gly Ser 

20 25 30 

Phe Pro Gin Ser Arg Glu Val Arg Lys Tyr Arg Ala Asp Trp Lys Asn 

35 40 45 

Lys Arg He Thr Asp Glu Ala Tyr Glu Thr Phe Leu Lys Asn Glu He 

50 55 60 

Ala Arg Trp He Lys He Gin Glu Asp He Gly Leu Asp Val Leu Val 
65 70 75 80 

His Gly Glu Phe Glu Arg Asn Asp Met Val Glu Phe Phe Gly Glu Lys 

85 90 95 

Leu Gin Gly Phe Leu Val Thr Lys Phe Gly Trp Val Gin Ser Tyr Gly 

100 105 110 

Ser Arg Ala Val Lys Pro Pro He He Tyr Gly Asp Val Lys Trp Thr 

115 120 125 

Ala Pro Leu Thr Val Asp Glu Thr Val Tyr Ala Gin Ser Leu Thr Asp 

130 135 140 

Lys Pro Val Lys Gly Met Leu Thr Gly Pro Val Thr He Leu Asn Trp 
145 150 155 160 

Ser Phe Glu Arg Val Asp Leu Pro Arg Lys Val Ala Gin Asp Gin He 

165 170 175 

Ala Leu Ala He Asn Glu Glu Val Leu Ala Leu Glu Ala Ala Gly He 

180 1B5 190 

Lys Val He Gin Val Asp Glu Pro Ala Leu Arg Glu Gly Leu Pro Leu 

195 200 205 

Arg Ser Glu Tyr His Glu Gin Tyr Leu Lys Asp Ala Gly Phe He lie 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 464: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464: 
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Val Lys Met Met Pro 

1 5 
Asp Leu Glu Pro lie 
20 

Phe Asp lie Gly Leu 
35 

Glu Ala He Val Glu 
50 

Ala Pro Tyr Ser Glu 
65 

Pro Tyr Asn Thr Tyr 
85 

Gin Asp Glu Asn Cys 
100 

Gly He Leu Glu Gly 
115 



Arg Lys Phe Arg Val. Leu 
10 

Phe Gin His Lys Lys Gly 
25 

Phe Glu Phe Asp Ser Gly 
40 

Ala Glu Gly Arg Phe Asp 
55 

Thr Leu Thr Asn Leu Leu 
70 75 
Val Asp Glu Ser Phe Trp 
90 

Pro Lys He Arg Cys Ser 
105 

Thr 



Gin He Gly Gly Asp^. 
15 

Val Ser Trp Asp Tyr 
30 

Tyr Val Glu Ala He 
45 

Phe He Tyr He Gin 
60 

Gin Met lie Ser Glu 
80 

Ser Val Glu Tyr Glu 
95 

Thr Asn Tyr He Thr 
110 



(2) INFORMATION FOR SEQ ID NO: 465: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465: 



Val Arg Lys Gly Tyr Tyr Pro He Lys Arg Ala He Asp Leu Val Leu 

15 10 15 

Ser He Val Leu Leu Phe Leu Thr Phe Pro He Met Phe He Phe Ala 

20 25 30 

He Ala lie Val He Asp Ser Pro Gly Asn Pro He Tyr Ser Gin val 

35 40 45 

Arg Val Gly Lys Met Gly Lys Leu He Lys He Tyr Lys Leu Arg Ser 

50 55 60 

Met Cys Lys Asn Ala Glu Lys Asn Gly Ala Gin Trp Ala Asp Lys Asp 
65 70 75 80 

Asp Asp Arg He Thr Asn Val Gly Lys Phe He Arg Lys Thr Arg He 

85 90 95 

Asp Glu Leu Pro His Leu He Asn Val Val Lys Gly Glu Met Ser Phe 
100 105 110 
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_Ile Gly Pro Arg Pro Glu Arg Pro 
115 120 
Glu Val lie Gly Phe Glu Gin Arg 

130 135 
Gly Leu Ala Gin He Gin Gly Gly 
145 150 
Leu Lys Tyr Asp Met Lys Tyr He 
165 

Leu Tyr He Ser He Arg Thr Leu 
180 

Ser Arg 



Glu Phe -Val Glu Leu Phe Ser Ser.- 
125 

Cys Leu Val Thr Pro Gly Leu Thr 
140 

Tyr Asp Leu Thr Pro Gin His Lys 
155 160 
His Lys Gly Ser Leu Met Met Glu 

170 175 
Met Val Val He Thr Gly Glu Gly 
185 190 



(2) INFORMATION FOR SEQ ID NO: 466: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:466: 

Val Lys Ala Leu Lys Leu Tyr Gly Val Glu Asp Leu Arg Tyr Glu Asp 

15 10 15 

Asn Glu Lys Pro Val He Glu Ser Ala Asn Asp Val He val Lys Val 

20 25 30 

Arg Ala Thr Gly He Cys Gly Ser Asp Thr Ser Arg Asn Xaa Xaa Met 

35 40 45 

Gly Pro Tyr He Lys Gly Met Pro Phe Gly His Glu Phe Ser Gly Val 

50 55 60 

Val Asp Ala He Gly Ser Asp Val Thr His Val Asn Val Gly Asp Lys 
65 70 75 80 

Val Thr Gly Cys Pro Ala He Pro Cys Tyr Gin Cys Glu 
85 90 

(2) INFORMATION FOR SEQ ID NO: 467: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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.. . (D) TOPOLOGY: linear . - * 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 

Val Gly Glu Ala Glu Ser Ser Leu Ser Phe He Gly Glu Leu Asp Asn 

15 10 15 

He Asp Asp Lys Thr Tyr Gin Asp Ala Leu Val He Val Cys Asp Thr 

20 25 30 

Ala Asn Ala Pro Arg He Asp Asp Glu Arg Tyr Ser Thr Gly Ser Lys 

35 40 45 

Leu He Lys He Asp His His Pro Ala Val Asp Gin Tyr Gly Asp He 

50 55 60 

Asn Leu Val Asn Thr Asn Ala Ser Ser Thr Ser Glu He He Tyr Asp 
65 70 75 80 

Leu He Ser His Phe Asn Asp Glu Ala He Val Asn Lys Gly Thr Ser 

85 90 95 

Glu Cys Phe He Pro Trp Tyr Arg Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 468: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 amino acids 

(B ) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 

Val Met Ser Leu Val He Val Leu He Lys Asp Gly Cys Val Ser Lys 

15 10 15 

Phe Ser Leu He Arg Gin Gly Asn Met He Lys Arg Asp Xaa Pro Met 

20 25 30 

He Pro Leu His Gin Thr Glu Glu Glu Glu Phe Tyr Thr Phe He Gly 

35 40 45 

Gin Phe Tyr Ser Leu Asn Gin His He Leu Pro Lys Glu Val His Val 

50 55 60 

Pro Arg Asn Leu Asp Lys Glu Met He Gin Ser Val Val Asp Thr Lys 
65 70 75 80 
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.lie Val Gin Pro Ala Arg Gly Pro Lys Lys.Asp Met Val Asp Leu Alav. 

85 90 95 

Ala His Asn Ala Lys Val Ser Leu Asn Asn Lys Phe Glu Leu He Ser 

100 105 110 

Arg Asp Glu Ser Arg Thr He Lys Ala He Glu Glu Leu Gly Thr Gin 

115 120 125 

Met Gly He Gin Thr Pro He Arg He Glu Ala Phe Asp Asn Ser Asn 

130 135 140 

He Gin Gly Val Asp Pro Val Ser Ala Met val Thr Phe val Asp Gly 
145 150 155 160 

Lys Pro Asp Lys Lys Asn Tyr Arg Lys Tyr Lys He Lys Thr Val Lys 

165 170 175 

Gly Pro Asp Asp Tyr Lys Ser Met Arg Glu Val Val Arg Arg Arg Tyr 

180 185 190 

Ser Arg Val Leu Asn Glu Gly Leu Pro Leu Pro Asp Leu He He Val 

195 200 205 

Asp Gly Gly Lys Gly His Met Asn Gly Val lie Asp Val Leu Gin Asn 

210 215 220 

Glu Leu Gly Leu Asp He Pro Val Ala Gly Leu Gin Lys Asn Asp Lys 
225 230 235 240 

His Gin Thr Ser Glu Leu Leu Leu Trp Arg 
245 250 



(2) INFORMATION FOR SEQ ID NO: 469: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:469: 



Val Val Gin Xaa Thr Arg Phe His 

1 5 
Pro Leu Ser Glu Gly Thr Val Ser 
20 

His Asp Gin Lys He Asp Leu Asn 

35 40 
Glu Gly Cys Val Asp Val Tyr Glu 
50 55 



Asn He Cys Pro His Lys Gin Gly 

10 15 
Gly Glu Tyr Val Phe Cys Pro Leu 
25 30 
Thr Gly lie Val Gin Glu Pro Asp 
45 

Val Glu Val Thr Asp Gly Asn Val 
60 
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_Tyr He Cys Leu . ~ v»» 

65 

(2) INFORMATION FOR SEQ ID NO: 470: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:470: 

Val His Val Leu Ala Phe Leu Thr Lys His His Ser Glu Lys Phe Asn 

15 10 15 

Ser Ser Ser Leu Ala Glu Leu Thr Cys Leu Xaa Pro Val Gin Leu Arg 

20 25 30 

Arg Val Thr Thr Gin Leu Val Asp Leu Xaa Met He Asp Thr He Arg 

35 40 45 

Gly Lys Asp Gly Gly Tyr Leu Ala Asn Asp Gin Ser Ala Asp Val Ser 

50 55 60 

Leu Ala Thr Leu Tyr Lys His Phe Val Leu Glu Lys Glu Gin His Thr 
65 70 75 80 

Arg Leu Phe Thr Trp Arg Arg Arg Gin Ser Leu Ser Asn Cys Ser 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 471: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471: 

Val Gly Phe Leu Asn His Met Leu Thr Leu Phe Thr Phe His Ser Gly 

15 10 15 

Leu Ser Leu Asn He Glu Ala Gin Gly Asp He Asp Val Asp Asp His 
20 25 30 
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His Val .Thr Glu Asp lie Gly He Val lie. Gly Gin Leu Leu Leu Glu,. 

35 40 45 

Met He Lys Asp Lys Lys His Phe Val Arg Tyr Gly Thr Met Tyr He 

50 55 60 

Pro Met Asp Glu Thr Leu Ala Arg Val Val Val Asp He Ser Gly Arg 
65 70 75 80 

Pro Tyr Leu Ser Phe Asn Ala Ser Leu Ser Lys Glu Lys Val Gly Thr 

85 90 95 

Phe Asp Thr Glu Leu Val Glu Glu Phe Phe Arg Ala Val Val He Asn 

100 105 no 

Ala Arg Leu Thr Thr His He Asp Leu He Arg Gly Gly Asn Thr His 

H5 120 125 

His Glu He Glu Ala He Phe Lys Ala Phe Ser Arg Ala Leu Gly He 

130 135 140 

Ala Leu Thr Ala Thr Asp Asp Gin Arg val Pro Ser Ser Lys Gly Val 
145 150 155 160 

He Glu 



(2) INFORMATION FOR SEQ ID NO: 472: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 

Val Asn Thr Ala Met Gly Asn Leu Lys Gin Gly He Ala Asn Lys Asp 

10 is 
Thr Val Lys Ala Ser Glu Asn Tyr His Asp Ala Asp Val Asp Lys Gin 

20 25 30 

Thr Ala Tyr Thr Asn Ala Val Ser Gin Ala Glu Gly He He Asn Gin 

35 40 45 

Thr Thr Asn Pro Thr Leu Asn Pro Asp Asp He Thr Pro Cys He Asn 

50 55 60 

Ser Ser Asp 
65 

(2) INFORMATION FOR SEQ ID NO: 473: 
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. . (i ) SEQUENCE CHARACTERISTICS : . « 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473: 

Val Glu Ala Gin Lys Asp Val Ala Asn Val Leu Glu Asn Val Glu Gin 

15 10 15 

Val Asp Ala Xaa Val Gly Ala Thr Asp Thr lie Ala Leu Ala Ala Tyr 

20 25 30 

Lys Tyr Tyr Ser Asp Lys Asn Asp Val Met Lys Pro His Gin lie Tyr 

35 40 45 

Gly Xaa Gly Gly Asp Pro Met Thr Gin Leu Val Ser Pro Ser lie Lys 

50 55 60 

Thr lie His Tyr Asn Tyr Cys Glu Ala Gly Gin Cys Ala Arg Trp Gin 
65 70 75 80 

Arg Tyr Asn Arg Cys Leu Lys Ser Lys lie Cys His lie Ala Ser Gin 
85 90 95 

Xaa Asp Cys 



(2) INFORMATION FOR SEQ ID NO: 474: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:474: 

Val Asp His Glu Val Phe Gin Gin Phe Gly Glu Ser Leu Pro Val Tyr 

15 10 15 

Lys Pro Thr Leu Pro Pro Met Val Phe Gly Asn Arg Asp Lys Lys lie 

20 25 30 

Xaa Gly Gly Thr Asp Ala Leu Val Leu Arg Tyr Leu Thr Pro His Gly 
35 40 45 
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Xaa Trp Asn lie His Ser Met Tyr Gin Asp. Asn Lys His Met Leu Thr v ,. 

50 55 60 

Leu Phe Arg Gly Val His Arg Phe Gly Tyr Xaa Met Lys Met Leu Xaa 
65 70 75 80 

Asn Thr He Ser Lys He Met He Gly 
85 

(2) INFORMATION FOR SEQ ID NO: 475: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475: 

Val Tyr Xaa Arg Asn Gly Val Val Thr Ala Arg Ala Val He Ser His 

15 10 15 

Arg Met Pro Lys Gly Thr Met Phe Met Tyr His Ala Gin Asp Lys His 

20 25 30 

He Gin Thr Pro Gly Ser Glu He Thr Asp Thr Arg Gly Gly Ser His 

35 40 45 

Asn Ala Pro Thr Arg He His Leu Lys Pro Thr Gin Leu Val Gly Gly 

50 55 60 

Tyr Ala Gin He Ser Tyr His Phe Asn Tyr Tyr Gly Pro He Gly Asn 
65 70 75 80 

Gin Arg Asp Leu Tyr Val Ala Val Arg Lys Met Lys Glu Val Asn Trp 
85 90 95 

Leu Glu Asp 



(2) INFORMATION FOR SEQ ID NO: 476: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 476 : 

Val Thr Trp Gly Thr Asn Pro Glu Met Gly Val Asn Phe Ser Glu Pro 

15 10 15 

Phe Pro Glu lie Asn Asp lie Asn Asp Gin Arg Ala Tyr Asp Tyr Met 

20 25 30 

Gly Leu Glu Pro Gly Gin Lys Ala Glu Asp He Asp Leu Gly Tyr Val 

35 40 45 

Phe Leu Gly Ser Cys Thr Asn Ala Arg Leu Ser Asp Leu He Glu Ala 

50 55 60 

Ser His He Val Lys Gly Asn Lys Val His Pro Asn He Thr Ala He 
65 70 75 . 80 

Val Val Pro Gly Ser Arg Thr Val Lys Lys Glu Ala Glu Lys Leu Gly 

85 90 95 

Leu Asp Thr He Phe Lys Asn Ala Gly Phe Glu Trp Arg Glu Pro Gly 

100 105 110 

Cys Ser Met Cys Leu Gly Met Asn Pro Asp Gin Val Pro Glu Gly Val 

115 120 125 

His Cys Ala Ser Thr Ser Asn Arg Asn Phe Glu Gly Arg Gin Gly Lys 

130 135 140 

Gly Ala Arg Thr His Leu Val Ser Pro Ala Met Ala Ala Ala Ala Ala 
145 150 155 160 

He His Gly Lys Phe Val Asp Val Arg Lys Val Val Val 
165 170 



(2) INFORMATION FOR SEQ ID NO: 477: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 264 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 



Val Trp Asn Arg His Val Leu Xaa 

1 5 
Leu Tyr He Asp Leu His Leu He 
20 

Phe Glu Gly Leu Arg Leu Gin Asn 
35 40 



Gly Lys Xaa Gly Asp Pro Gin Leu 

10 15 

His Glu Val Thr Ser Pro Gin Ala 

25 30 

Arg Lys Leii Arg Arg Pro Asp Leu 
45 
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. Thr Phe Ala Thr Leu Asp His Asn Val Pro. Thr lie Asp He Phe Asn^ 
50 55 60 

He Lys Asp Glu He Ala Asn Lys Gin He Thr Thr Leu Gin Lys Asn 
65 70 75 80 

Ala He Asp Phe Gly Val His He Phe Asp Met Gly Ser Asp Glu Gin 

85 90 95 

Gly He Val His Met Val Gly Pro Glu Thr Gly Leu Thr Gin Pro Gly 

100 105 110 

Lys Thr He Val Cys Gly Asp Ser His Thr Ala Thr His Gly Ala Phe 

115 120 125 

Gly Ala He Ala Phe Gly He Gly Thr Ser Glu Val Glu His Val Phe 

130 135 140 

Ala Thr Gin Thr Leu Trp Gin Thr Lys Pro Lys Asn Leu Lys He Asp 
145 150 155 160 

He Asn Gly Thr Leu Pro Thr Gly Val Tyr Ala Lys Asp He He Leu 

165 170 175 

Kis Leu He Lys Thr Tyr Gly Val Asp Phe Gly Thr Gly Tyr Ala Leu 

180 185 190 

Glu Phe Thr Gly Glu Thr He Lys Asn Leu Ser Met Asp Gly Arg Met 

195 200 205 

Thr He Cys Asn Met Ala lie Glu Gly Gly Ala Lys Tyr Gly He He 

210 215 220 

Gin Pro Asp Asp He Thr Phe, Glu Tyr Val Lys Gly Arg Pro Phe Ala 
225 230 235 240 

Asp Asn Phe Ala Lys Ser Val Asp Lys Trp Arg Glu Leu Tyr Ser Asp 

245 250 255 

Gly Thr Thr Arg Tyr Leu He Val 
260 

(2) INFORMATION FOR SEQ ID NO: 478: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 81 amino acids 

{ B ) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478: 

Val He He Asn Lys Val Met Thr Ser Asp Thr Pro Val Thr He Val 
15 10 15 
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Ala Thr Gly Pro Leu Thr Asn Val 
20 

Arg lie Ala Glu His lie Glu Ser 

35 40 
Phe Gly Asn Trp Thr Ala Tyr Ser 

50 55 
Phe Leu Thr Lys Ser Xaa Cys Gly 
65 70 
He 



Ala Thr Ala Leu lie Arg Asp Pro 
25 30 
He Thr Leu Met Gly Gly Gly Thr 
45 

Arg He Leu Ser He Pro Thr Ser 
60 

Phe Val Asn Met Pro Leu Gly Val 
75 80 



(2) INFORMATION FOR SEQ ID NO: 479: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 



Val Ala His Xaa His Val Val Asn 

1 5 
He Val Xaa Gly Trp Gin Gly Val 
20 

Glu Leu Asp Thr Tyr lie Lys Xaa 

35 40 
Lys His Leu Xaa Leu Phe 
50 



Gly Thr Tyr Tyr Leu His Xaa His 

10 15 
Lys Lys Thr Cys Asp Thr Ala Glu 
25 30 
Ser Asp Val Val Tyr Glu Glu Gin 
45 



(2) INFORMATION FOR SEQ ID NO: 480: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



. (ii) MOLECULE TYPE: Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480: 
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.Val Ala His lie His Val Val Asn 

1 5 
lie Val Pro Gly Trp Gin Gly Val 
20 

Glu Leu Glu Thr Tyr lie Lys Gin 
35 40 



Gly Thr- Tyr Tyr Phe His Gly His-." 

10 15 
Lys Lys Thr Phe Asp Thr Ala Glu 
25 30 
Ser Asp Ser Gly lie 
45 



(2) INFORMATION FOR SEQ ID NO: 481: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481: 



Val Cys 
1 

Asp Arg 
Lys lie 



Thr Tyr Cys 
5 

Glu lie Leu 
20 

Ala Thr Cys 
35 

Gin Arg Leu 



Gly Val Gly Cys 
Lys Val 
Val Lys 



Ser Asp 
50 

His Glu Val Glu Trp 
65 

Thr Ser 



Ser Ser 

Arg Gin 

Xaa Lys 
130 
Gly Asp 
145 



lie Lys Glu 
85 

Lys Ala Thr 
100 

Val He Gly 
115 

His Leu Gin 
Ser Arg 



Thr Lys 
55 

Asp Glu 
70 

Lys Tyr 

Asn Glu 

Thr Asn 

Gin Lys 
135 



Gin Pro 

25 
Gly Lys 
40 

Pro Leu 



Ala Leu 

Gly Pro 

Glu Ser 
105 
Asn Val 
120 

Ala Tyr 



Ser Phe Glu Val Trp Thr Lys 
10 15 
Ser His Asp Ser Pro Ala Asn 
30 

Phe Ser Trp Gly His He Asn 
45 

Val Arg Lys Asn Gly Glu Phe 
60 

Asn Val He Ala Asp Asn Phe 

75 80 
Asp Ala Leu Ser Phe He Ser 
90 95 
xaa Leu Xaa Gin Lys Leu Ala 
110 

Asp Ser Leu Xaa Lys Asp He 
125 

Leu Glu Arg Leu Asp Thr Ala 
140 



(2) INFORMATION FOR SEQ ID NO: 482: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 106 amino acids 
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.. . (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482: 

Val Thr Lys Ala Val Gly Glu Arg He Pro He Thr lie He Val Ala 

15 10 15 

Val Leu Ala Leu He Val Ala Leu He He Ala He Pro He Gly He 

20 25 30 

He Ser Ala Met Lys Arg Asn Ser Trp Leu Asp He Thr Leu Met He 

35 40 45 

He Ala Leu He Gly Leu Ser He Pro Ser Phe Trp Gin Gly Leu Leu 

50 55 60 

Phe He Leu Ala Xaa Ser Leu Lys Leu Asp He Leu Pro Pro Ser Tyr 
65 70 75 80 

Met Pro Glu His Pro He Ser Val Asp Phe Thr Cys Thr Cys His Trp 

85 90 95 

Asn Lys Tyr Cys Cys Phe Tyr His Ala Tyr 
100 105 

(2) INFORMATION FOR SEQ ID NO: 483: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 46 amino acids 
(5) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483: 

Val Ala Glu Leu Leu Gly Gly Ser Ala Val Thr Glu Gin Val Phe Asn 

1 * 5 10 15 

He Asn Gly He Gly Arg Tyr He Val Gin Lys Gin Leu He Pro Asp 

20 25 30 

He Pro Ala Val Met Gly Gly Val Val Tyr He Tyr Gin Leu 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 484: 
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w 



15 



25 



(i) SEQUENCE CHARACTERISTICS: 
<A} LENGTH: 152 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO:484: 

Val Thr Kis lie lie lie Asp Gly Asp Ala Cys Pro Val Xaa Asp Ser 

15 10 15 

lie lie Asp Leu Thr Thr Glu Thr Gly lie Phe Val Thr lie lie Arg 

20 25 30 

Ser Phe Ser His Phe Ser Asn Gin Leu Tyr Pro Pro His Val Ser Thr 

35 40 45 

Leu Tyr Val Asp Asp Gly Pro Asp Ala Val Asp Tyr Lys He Val Gin 

50 55 60 

Leu Ser Thr Lys Asp Asp He Val Val Thr Gin Asp Tyr Gly Leu Ala 
65 70 75 80 

Ser Leu Leu Val Asp Lys Val Leu He Val Met His His Asn Gly Lys 

85 90 95 

He Tyr Asn Ser Lys Asn He Gin Gin Leu Leu Asp Lys Arg Tyr Met 

100 105 110 

Asn Ala Gin He Arg Lys Gin Gly Gly Arg His Lys Gly Pro Pro Pro 

115 120 125 

Phe Thr Lys Gin Asp Gin Lys Val Phe Glu Gin Ser Leu Leu Lys Val 

130 135 140 

He His Arg He Lys Glu Leu Asp 
145 150 

40 

(2) INFORMATION FOR SEQ ID NO-.485: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
so (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485: 



35 
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.Val lie Glu Ser Leu Pro Glu His Val Asp- Thr He He Asp He Lys*. 

15 10 15 

Ser Arg Thr Glu Gly Glu Leu He Thr Lys Glu Lys Glu Leu Val Gin 

20 25 30 

Leu Lys Phe Thr Pro Glu Asn He Asp Asn Val Asp Lys Glu Tyr His 

35 40 45 

Arg Glu Thr Phe Gly Glu Phe Asp Thr Arg Arg Thr Phe Glu Lys Cys 

50 55 60 

Asn Ser 
65 



(2) INFORMATION FOR SEQ ID NO: 486: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486: 



Val Asp Ser He He Leu Asp Ala Ala Gin Thr Pro Leu Val He Ser 

15 10 15 

Gly Ala Pro Arg Leu Gin Ser Asn Leu Phe His He Val Lys Glu Phe 

20 25 30 

Val Asp Thr Leu He Glu Asp val His Phe Lys Met Lys Lys Thr Lys 

35 40 45 

Lys Glu He Trp Leu Leu Asn Gin Gly He Glu Ala Ala Gin Ser Tyr 

50 55 60 

Phe Asn val Xaa Asp Leu Tyr Ser Glu Gin Ala Met Val Leu Val Arg 
65 70 75 80 

Asn He Asn Leu Ala Leu Arg Ser Cys Arg Met Phe Leu Gly Ser Tyr 

85 90 95 

Lys Ser Tyr Arg Cys Lys Asn Val Leu Lys Phe Gly Cys Met He Gly 

100 105 110 

Glu Gin Pro Gly Gly 
115 



(2) INFORMATION FOR SEQ ID NO: 487: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 270 amino acids 
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... -(B) TYPE: amino acid ... -t 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 487: 

Val Val Ser Leu Ala Xaa Val Leu lie lie Gly He Gly Ala Leu Thr 

15 10 15 

Asn Ser He Ser Xaa Asn Leu Gly Arg Asn Thr Val Leu Val Leu Gly 

20 25 30 

He Ala Leu Pro Xaa He Tyr Phe Thr Leu He He Arg Ser Xaa Asp 

35 40 45 

Val Xaa Asp Thr Xaa Arg Ser Arg Val Lys Ala Phe He Pro Leu Phe 

50 55 60 

He Leu Gly Met Val Phe Trp Ala He Gin Glu Gin Gly Ser Asn Val 
65 70 75 80 

Leu Asn He Tyr Gly Xaa Xaa His Ser Asp Met Lys Leu Asn Leu Phe 

85 90 95 

Gly Trp Lys Thr Xaa Phe Gly Glu Ala He Phe Gin Ser He Asn Pro 

100 105 110 

Leu Phe He Leu Leu Leu Ala Pro He He Ser Leu Leu Trp Gin Lys 

115 120 125 

Xaa Gly Thr Lys Gin Pro Ser Leu Pro Val Lys Phe Ala He Gly Thr 

130 135 140 

Phe Leu Ala Gly Ala Ser Tyr He Leu He Gly He Val Gly Tyr Ala 
145 150 155 160 

Ser Gly Ser Ser Asn Phe Ser Val Asn Trp Val He Leu Ser Tyr He 

165 170 175 

He Cys Val He Gly Glu Leu Cys Leu Ser Pro Thr Gly Asn Ser Ala 

180 185 190 

Ala Val Lys Leu Ala Pro Lys Ala Phe Asn Ala Gin Met Met Ser He 

195 200 205 

Trp Tyr Leu Thr Asn Ala Ser Ala Gin Ala He Asn Gly Thr Leu Val 

210 215 220 

Lys Leu He Glu Pro Leu Gly Gin Thr Asn Tyr Phe He Phe Leu Gly 
225 230 235 240 

Val Val Ala He He Val Thr Thr He Cys He Ser He Leu Thr Phe 

245 250 255 

Asn His Gin Ser Asp Glu Arg Tyr Thr Leu He Leu Leu Ala 
260 265 270 

(2) INFORMATION FOR SEQ ID NO: 488: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488: 



20 



30 



40 



Val Thr Pro Pro His Pro Asn Gly Val Ser Gin Glu Val Leu Ala Ala 

15 10 15 

Cys Tyr Leu Thr Gin Xaa Thr Gin Val Xaa Xaa Xaa Gly Gly Ala Xaa 

20 25 30 

Ser lie Ala Xaa Leu Thr Tyr Xaa Ala Xaa Thr He Pro Lys Val Asp 

35 40 45 

Lys He Xaa Gly Pro Gly Thr Gin Phe Val Ala Ser Xaa Xaa Lys Xaa 

50 55 60 

Leu Phe Gly Gin Val Ala Tyr Cys Ser His Xaa Arg Xaa Thr Xaa Gin 
65 70 75 80 

(2) INFORMATION FOR SEQ ID NO: 4 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489: 



Val Phe Gin Val Gly Gly Ala Gin Ser He Ala Ala Leu Thr Tyr Gly 

15 10 15 

Thr Glu Thr He Pro Lys Val Asp Lys He Val Gly Pro Gly Asn Gin 

20 25 30 

Phe Gly Xaa He Cys Pro Lys Asn He Tyr Leu Asp Arg 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 490: 

55 

(i) SEQUENCE CHARACTERISTICS: 



356 



EP 0 841 394 A2 



(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 490: 



Val Gly Met Ser lie Val Leu Ala 

1 5 
Xaa Pro Met Leu Xaa Asn Asn Ser 
20 

His Ser Met Val Asp Ser Met Val 

35 40 
Thr Gin Tyr His Met Gly He Thr 

50 55 
Trp Tyr Phe Lys Arg Arg Thr Lys 
65 70 
Thr He Lys Gin Tyr Val His Ser 
85 



Gly Gly Met Glu Asn Met Ser Gin 

10 15 
Arg Phe Gly Phe Lys Met Gly His 
25 30 
Tyr Asp Gly Leu Thr Asp Val Phe 
45 

Ala Glu Asn Leu Val Gly Ala He 
60 

He His Val Ala Gly Asn Leu Thr 

75 80 
Lys Met Val Asn Leu He Val Lys 
90 95 



(2) INFORMATION FOR SEQ ID NO: 491: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:491: 



Val He Ser Met Lys Trp Leu Ser Arg He Leu Thr Val He Val Thr 

15 10 15 

Met Ser Met Ala Cys Gly Ala Leu He Phe Asn Arg Arg His Gin Leu 

20 25 30 

Lys Thr Lys Thr Leu Asn Phe Asn His Lys Ala Leu Thr He He He 

35 40 45 

Pro Ala Arg Asn Glu Glu Lys Arg He Gly His Leu Leu His Ser He 

50 55 60 

He Gin Gin Gin Val Pro Val Asp Val He Val Met Asn Asp Gly Ser 
65 70 75 80 
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- Thr Asp Glu Thr Ala Arg Val Ala 
85 

Asp Val Val Asp Asp Thr Asp Gly 
100 

Leu Leu Ser Arg Cys Asp Ala Cys 
115 120 

Arg Cys 



Arg Ser . Tyr Gly Ala Thr Val Val v. . 

90 95 
Lys Trp Tyr Gly Glu lie Thr Cys 
105 HO 
Met Tyr Glu Ser Tyr Cys Leu Cys 
125 



(2) INFORMATION FOR SEQ ID NO: 4 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



40 



45 



55 



(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492: 

Val Pro He Leu Gly Thr Val Phe Thr Ala Leu Thr Gly Pro He Gly 

1 5 10 15 

He Val Leu Gly Val Leu Ala Gly Leu Ala Val Ala Phe Thr He Ala 

20 25 30 

Tyr Lys Lys Ser Glu Thr Phe Arg Asn Cys Val Asn Gly Ala He Asn 

35 40 45 

Ser Val Lys Gin Thr Phe Ser Asn Phe He Gin Phe He Gin Pro Tyr 

50 55 60 

He Asp Ser Val Lys Asn Val Phe Lys Gin Ala Val Ser Ala He Gly 
65 70 75 80 

Asp Phe Ala Lys Asp He Trp Ser Gin He Asn Gly Phe Phe Asn Glu 

85 90 95 

Asn Gly He Ser He Ala Gin Ala Leu Gin Asn He Cys Asn Phe He 

100 105 HO 

Lys Ala He Phe Glu Phe He Leu Lys Phe Cys Asn 
115 120 

(2) INFORMATION FOR SEQ ID NO: 493: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
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15 



.(D) TOPOLOGY: linear _ , 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493: 

Val Lys Ser Leu Lys Thr Val lie Gly Met Asn Asn Lys Glu His lie 

15 10 15 

Lys Ser Val lie Leu Ala Leu Leu Val Leu Met Ser Val Val Leu Thr 

20 25 30 

Tyr Met Val Trp Asn Phe Ser Pro Asp lie Ala Asn Val Asp Asn Thr 

35 40 45 

Asp Ser Lys Lys Ser Glu Thr Lys Pro Leu Thr Thr Pro Met Thr Ala 
50 55 60 

2Q Lys Met Asp Thr Thr lie Thr Pro Phe Gin lie lie His Ser Lys Asn 

65 70 75 80 

Asp His Pro Glu Gly Thr lie Ala Thr Val Ser Asn Val Asn Lys Leu 
85 90 95 

2s Thr Lys Pro Leu Lys Asn Lys Glu Val Xaa Ser Val Glu His Val Arg 

100 105 110 

Arg Asp His Asn Leu Met lie Pro Asp Leu Ser Ser Asp Phe Thr Leu 
115 120 125 

30 Phe Asp Phe Thr Phe Asp Leu Pro Leu Ser Thr Tyr Leu Gly Gin Val 

130 135 140 

Xaa Glu His 
145 

35 

(2) INFORMATION FOR SEQ ID NO: 494: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494: 

50 

Val He Leu Pro Lys Leu Val Thr Tyr Gin Val Gin Pro Met Ser Met 
15 10 15 

ss Met Leu He Pro Lys Gly Xaa Xaa Glu Leu Ala Leu Arg Phe Asn Gin 

20 25 30 
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Gly lie Asn Xaa Gly Gly Asp Ala 

35 40 
lie Ala Gly Ala Phe Asn Pro Asp 

50 55 
Xaa Arg Leu Glu Xaa Lys Asp Xaa 
65 70 
Thr Thr Arg val Gin Gin Arg Glu 
85 



Leu Lys -Lys Arg Thr His Phe Asn v* 
45 

Val Arg Lys Leu Asp Gly Ala Val 
60 

Lys Arg Asn Val Leu Phe Xaa Asn 
75 80 

Asn His 
90 



(2) INFORMATION FOR SEQ ID NO: 495: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 6 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:495: 



Val Leu Val Gin Pro His lie Val lie Lys Pro Glu Ala Gin Gin Ala 

15 10 15 

lie Lys Ala Thr Ala Glu Asn Gin Val Glu Ser lie Lys Asp Thr Pro 

20 25 30 

His Ala Thr Val Asp Glu Leu Asp Glu Ala Asn Gin Leu He Ser Asp 

35 40 45 

Thr Leu Lys Gin Ala Gin Gin Glu He Glu Asn Thr Asn Gin Asp Ala 

50 55 60 

Ala Val Thr Asp Val Arg Asn Gin Thr He Lys Ala He Glu Gin lie 
65 70 75 80 

Lys Pro Lys Val Arg Arg Lys Arg Ala Ala Leu Asp Ser He Glu Glu 

85 90 95 

Asn Xaa Lys Asn Gin Leu Gly Cys Asn Pro Glu lie Arg Gly lie Leu 

100 105 110 

Leu Lys Asp Glu Arg Asp Cys Trp Leu Leu Asp Thr Phe Lys 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 496: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 



360 



EP 0 841 394 A2 



.(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496: 

Val Ser Leu Leu Pro Leu lie Ser Asp Asn Met Tyr Leu Gly Tyr He 

1 5 10 15 

Ala Gly Leu Leu Ala Ala Tyr Ala Gly Gly Phe Val Cys Thr Tyr Leu 

20 25 30 

Phe Gly Pro Thr Lys Ala Met Arg Gin Asp Arg Phe Val Gly 

35 40 45 

(2) INFORMATION FOR SEQ ID NO: 497: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:497: 

Val Gly Xaa Xaa He Met Lys Phe Lys Ala He Ala Lys Xaa Ser Leu 

1 5 10 15 

Ala Leu Gly Met Leu Ala Thr Gly Val He Thr Ser Asn Val Gin Ser 

20 25 30 

Val Gin Ala Xaa Thr Glu Val Xaa Gin Gin Ser Glu Ser Glu Leu Xaa 

35 40 45 

His Tyr Tyr Asn Lys Pro Val Leu Glu Arg Lys Asn Val Thr Gly Tyr 

50 55 60 

Xaa Tyr Thr Glu Lys Gly Lys Asp Tyr He Asp Val lie Val Asp Asn 
65 70 75 80 

Gin Tyr Ser Gin He Ser Leu Val Arg He 
85 90 

(2) INFORMATION FOR SEQ ID NO: 498: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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-(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498: 

Val lie Asn lie Glu Leu Val Leu His Gin Arg Xaa Ala Asn Pro Val 

1 5 10 15 

Arg He Ala Asn He Ser Asn Asn Ala Thr Val Ser Gin Ala Asp Gin 

20 25 30 

Ala Thr He He Asn Ser Leu Thr Phe Thr Ser Asn Ala Pro Asn Arg 

35 40 45 

Asn Tyr Ala Thr Ala Ser Ala Asn Glu He Thr Ser Lys Thr Val Ser 

50 55 60 

Asn Val Ser Arg Thr Gly Asn Asn Ala Asn Val Thr Val Thr Val Thr 
65 70 75 80 

His Gin Asp Gly Thr Thr Ser Thr Val Thr Val Pro Val Lys His Val 

85 90 95 

He Pro Glu He Val Ala His Ser His Tyr Thr Val Gin Gly Gin Asp 

100 105 HO 

Phe Pro Ala Gly Asn Gly Ser Ser Ala Ala Asp Tyr Phe Lys Leu Ser 

115 120 125 

Asn Gly Ser Ala He Pro Asp Ala Thr He Thr Trp Val Ser Gly Gin 

130 135 140 

Xaa Pro Lys 
145 

(2) INFORMATION FOR SEQ ID NO: 499: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 274 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499: 

Val Met Glu Asn Ser Arg Pro Glu Arg Asn Glu Ala Thr Met His Leu 

1 5 10 15 

Asp Glu Met Thr Val Glu Glu Ala Leu He Thr Met Asn Lys Glu Asp 
20 25 30 
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. Gin Gin val Pro Leu Ala Val Arg Lys Ala. lie Pro Gin Leu Thr Lys , 
35 40 45 

Val He Lys Lys Thr He Ala Gin Tyr Lys Lys Gly Gly Arg Leu He 

50 55 60 

Tyr He Gly Ala Gly Thr Ser Gly Arg Leu Gly Val Leu Asp Ala Ala 
65 70 75 80 

Glu Cys Val Pro Thr Phe Asn Thr Asp Pro His Glu He He Gly He 

85 90 95 

He Ala Gly Gly Gin His Ala Met Thr Met Ala Val Glu Gly Ala Glu 

100 105 no 

Asp His Lys Lys Leu Ala Glu Glu Asp Leu Lys Asn He Asp Leu Thr 

115 120 125 

Ser Lys Asp Val Val He Gly He Ala Ala Ser Gly Lys Thr Pro Tyr 

130 135 140 

Val He Gly Gly Leu Thr Phe Ala Asn Thr He Gly Ala Thr Thr Val 
1*5 150 155 160 

Ser He Ser Cys Asn Glu His Ala Val He Ser Glu He Ala Gin Tyr 

165 170 175 

Pro Val Glu Val Lys Val Gly Pro Glu Val Leu Thr Gly Ser Xaa Arg 

180 185 190 

Leu Lys Ser Gly Thr Ala Gin Lys Leu He Leu Asn Met He Ser Thr 

195 200 205 

He Thr Met Val Gly Val Gly Lys Val Tyr Asp Asn Leu Met He Asp 

210 215 220 

Val Lys Ala Thr Asn Gin Lys Leu He Asp Arg Ser Val Arg He He 
225 230 235 240 

Gin Glu He Cys Ala He Thr Tyr Asp Glu Ala Met Ala Leu Tyr Gin 

245 250 255 

Val Ser Glu His Asp Val Glu Ser Cys Asp Ser Tyr Gly Tyr Val Trp 
260 265 270 

His Phe 



(2) INFORMATION FOR SEQ ID NO: 500: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 228 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500: 
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Val Xaa Xaa He His Ser Gin Gin He Val He Ala Pro Asp Ser Phe 

15 10 15 

Lys Glu Ser Met Xaa Xaa His Gin Val Xaa Asn Ile He Lys Gin Ala 

20 25 3 0 

Phe Thr Asn Val Tyr Gly Asn Thr Leu His Tyr Asp He He Pro Met 
35 40 45 

io Pr ° ASP Gly Gly Glu Gly Thr Xaa Asp Xaa Leu Ile His Ala Xaa Xaa 

50 55 60 

Ala Thr Lys Tyr Thr Val He Val Asn Asp Pro Leu Met Arg Pro Ile 

65 70 75 80 

is Glu Ala Cys TV** Ala ^5 Ala As P Glu Gin Gin Ile Ala Ile Ile Glu 

85 90 95 

Met Ala Ala Ala Ser Gly Leu Asp Leu Leu Glu Lys Glu Glu Arg Asn 

100 105 110 

20 Pro Leu Thr Ser Ser Tyr Gly Thr Gly Glu Leu lie Lys Asp Ala 

115 120 125 

Leu Asn His Gly Ala Lys Thr He Ile Leu Gly Ile Gly Gly Ser Ala 

130 135 140 

2S Thr Asn Asp Gly Gly Thr Gly Met Leu Ser Ala Leu Gly Val Lys Phe 

145 150 155 160 

Thr Asp Val Asn Gly Asp Leu Leu Gin Met Asn Gly Ala Asn Leu Ala 

165 170 175 

30 His Ile Ala Gin Ile Asp Ile Thr Asn Leu Ala Asp Ser Arg Xaa Lys 

180 185 190 

Glu Val Thr Phe Lys Val Ala Cys Asp Val Ser Asn Pro Leu Leu Gly 

195 200 205 

35 Glu Asn Gly Ala Thr Tyr He Tyr Gly Pro Gin Lys Cys Ala Asp Ala 

210 215 220 

Lys Met His Thr 

225 

40 

(2) INFORMATION FOR SEQ ID NO: 501: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 117 amino acids 

<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

so 

(ii) MOLECULE TYPE: Protein 

ss (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501: 
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20 



25 



35 



40 



45 



50 



Val Lys Gin Cys lie Asn Asn Asp Glu Trp.Phe Val Thr Asn Asp Asn 

15 10 15 

Gly Tyr val Lys Glu Gin Tyr Leu Tyr xaa Ala Gly Arg Gin Gin Asp 

20 25 30 

Met Leu lie He Gly Gly Arg Asn He Tyr Pro Ala His Val Xaa Arg 

35 40 45 

Leu Leu Thr Gin Ser Ser Ser He Asp Glu Ala He He He Gly He 

50 55 60 

Pro His Glu Arg Phe Gly Xaa He Gly Val Leu Leu Tyr Ser Gly Asp 
65 70 75 80 

Val Thr Leu Thr His Lys Asn Xaa Lys Gin Phe Xaa Lys Lys Lys Val 

85 90 95 

Lys Arg His Glu He Pro Phe Asp Asp Ser Ser Cys Arg Lys Asp Val 

100 105 110 

Leu Xaa Cys Lys Trp 
115 

(2) INFORMATION FOR SEQ ID NO: 502: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502: 



Val Ser Ala He Phe Glu Pro Glu His Leu Glu Ala Leu Leu Ala Gin 

15 10 15 

Val Ser He Lys Pro Val He Asn Gin Val Glu Tyr His Pro Tyr Leu 

20 25 30 

Thr Gin His Lys Leu Lys Leu Tyr Leu Ala Ala Gin His He Val Met 

35 40 45 

Glu Ser Trp Ser Pro Leu Met Asn Ala Gin He Leu Asn Asp Glu Thr 

50 55 60 

He Lys Asp He Ala Gin Glu Leu Gly Lys Ser Pro Ala Gin Val Val 
65 70 75 80 

Leu Arg Trp Asn Val Gin His Gly Val Val He He Pro Lys Ser Val 

85 90 95 

Thr Pro Asn Arg He Ser Glu Asn Phe Gin He Phe Asp Phe Glu Leu 
55 100 105 110 
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.Ser Asp Glu Gin Met Thr Leu Val Pro Gly. Leu Asn Leu Asp Lys Arg^. 

115 120 125 

He Gly Pro Asp Xaa Xaa Thr Phe Glu Gly 
130 135 



(2) INFORMATION FOR SEQ ID NO: 503: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE : Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503: 



Val Phe Lys Gly Ser Glu Tyr Gin 

1 5 
Val Lys Tyr Glu Gin Thr Tyr Asp 
20 

Lys Ala Met Leu Asn Phe Asn He 

35 40 
Lys Gin Ser Met Met Asp Asp He 

50 55 
Lys His Gin Val He Gly Val Arg 
65 70 



Leu Ser Glu He Asn Ser Gly Ser 

10 15 
Asn Phe Pro He Leu Asn Asn Ser 
25 30 
Glu Asp Asn Lys Ala Ala Ser Tyr 
45 

Lys Pro Thr Asp Gly Ala Asp Lys 
60 

Lys Ala He Glu Ala Leu 
75 



(2) INFORMATION FOR SEQ ID NO: 504: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504: 

Val Asn Glu Thr Asn Val Gin Leu Leu Gin Pro Asn Trp Glu lie Lys 

15 10 15 

Val Lys His Asp Gly Lys Asp Lys Thr Asn Thr Tyr Tyr Val Glu Ala 
20 25 30 
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Thr Asn Asn Asn Pro Lys He He Asn His... v 
35 40 

(2) INFORMATION FOR SEQ ID NO: 505: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 505: 

Val Leu He Tyr Phe He Glu Gly Leu Met Ser Xaa Asn Gin Pro Ala 

15 10 15 

Glu Ala Leu Glu Xaa Leu Ser Tyr Val Asp Pro Ser Pro Xaa Xaa Leu 

20 25 30 

Met Xaa Lys Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 506: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506: 

Val Gly He Asn Ser Lys Gin Gly Lys Asn Leu He Gly Ala Phe Tyr 

15 10 15 

Arg Pro Thr Ala Val lie Tyr Asp Leu Asp Phe Leu Lys Thr Leu Pro 

20 25 30 

Phe Glu Gin He Leu Ser Gly Tyr Ala Glu Val Tyr Lys His Ala Leu 

35 40 45 

Leu Asn Gly Glu Ser Thr Thr Gin Glu He Glu Gin His Phe Lys Asp 

50 55 60 

Arg Glu He Leu Gin Pro Leu Asn Gly He Gly 
65 70 75 
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(2) INFORMATION FOR SEQ ID NO: 507: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507: 

Val Val Thr Leu Phe Leu Cys Pro Xaa Asn Ser Phe Leu Met lie His 

15 10 15 

Asn Ser Trp Val Met Thr Val Gly Asn Ala Glu Glu Leu Arg Lys Thr 

20 25 30 

Ala Asp Leu Leu Glu Lys Thr Asp Ala Val Ser Asn Ser Ala Tyr Leu 

35 40 45 

Asp Lys Xaa Xaa Asp Leu Asp Gin Glu His Leu Lys Gin Met Leu Asp 

50 55 60 

Ala Glu Thr Trp Leu Thr Ala Glu Glu Ala Leu Ser Phe Gly Leu lie 
65 70 75 80 

Asp Glu lie Leu Gly Ala Asn Glu lie Ala Ala Ser lie Ser Lys Glu 

85 90 95 

Gin Cys Arg Arg Xaa Glu Xaa Val Pro Glu Asp Leu Lys Lys Asp Val 

100 105 110 

Xaa Lys lie Thr Lys lie Asp Asp Xaa Arg Tyr Asp Leu Asp Trp Leu 

115 120 125 

Lys Pro Pro Lys Glu Ser Met Ser Leu Glu Glu Xaa Xaa Xaa Arg Xaa 

130 135 140 

Xaa lie Val Arg Ser Asn Ala Lys Phe 
145 150 

(2) INFORMATION FOR SEQ ID NO: 508: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508: % 

Val Lys Glu lie Pro Asp Ala Ser lie Ser Phe Glu Val Phe Ala Asp 

1 5 10 15 

Asp Leu Glu Thr Met Glu Lys Glu Ala Ala lie Leu Lys Gin Tyr Gly 

20 25 30 

Glu Asn Val Phe Val Lys lie Pro lie Val Asn Thr Lys Gly Glu Ser 

35 40 45 

Thr lie Pro Leu He Lys Lys Leu Ser Ala Asp Asn Val Arg Leu Asn 

50 55 60 

Val Xaa Ala Val Tyr Thr He Glu Gin Gly Lys Glu He Thr Glu Ala 
65 70 75 80 

Val Thr Glu Gly Val Pro Asn He Cys Phe Ser He Cys Arg Thr Tyr 

85 90 95 

Cys Arg Tyr Arg Arg Arg Ser He Thr He Asn Glu Arg Gly Cys Lys 
100 105 110 

Ser Tyr Ala 
115 

(2) INFORMATION FOR SEQ ID NO: 509: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509: 

Val Tyr Xaa Arg Val Asn Glu Met Asn Ala Lys Glu Xaa Leu Val Asp 

1 5 10 15 

Xaa Leu Met Lys Thr Ser Ser Gin Leu Phe Lys Xaa His Gly Glu Val 

20 25 30 

Xaa Met Gin Leu Xaa Leu Asn Asp Glu Leu Lys Leu Pro Ser He Xaa 

35 40 45 

Glu He Cys Val Glu Arg Lys Arg Leu Ser Asp He Val Lys Val He 

50 55 60 

Pro Gin Ser Tyr Ala Leu Leu Tyr He Asp Lys Gin Asp Gin Ala Arg 
65 70 75 80 

Ala Lys Xaa Xaa Leu Ser Leu Xaa Lys He Ala Lys Val Tyr Val Gin 
. 85 90 95 
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Tyr Asp 



Asp 



Thr 



100 



Thr 



He 



Met Ser 



He Phe 
105 



yal Tyr Asp Val Val 
110 



Asn 



Asp Glu Trp He Leu Arg Leu Asp Pro Asn He Arg He Pro Lys Ser 

115 120 125 

Asn He Tyr Phe His Ser Leu Asn Trp Asp Val Asp Tyr He Lys Pro 

130 135 140 

Glu He Val Leu Met Tyr Asp Leu Met Gin His His Gin Tyr His His 
145 150 155 160 

Tyr Ser Asn Tyr Lys Arg Val He Asp Xaa Leu Ser Tyr Tyr Gin Phe 

165 170 175 

Phe He Leu Lys Phe Val Val Gly Glu Xaa Arg He Lys Asp Ala He 

180 185 190 

Gin Lys Asn Asn Lys 
195 

(2) INFORMATION FOR SEQ ID NO: 510: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510: 

Val Pro Pro Gin Gin Gin Ala Pro Thr Lys Gin Arg Pro Ala Lys Xaa 

15 10 15 

Glu Asn Asp Asp Lys Ala Ser Xaa Asp Glu Ser Lys Asp Lys Asp Asp 

20 25 30 

Asn Ala Ser Gin Asp Xaa Ser Asp Asp Thr Gin Lys Lys Thr Asp Asp 

35 40 45 

Asn Thr His Pro Ala Ala Arg Xaa 
50 55 

(2) INFORMATION FOR SEQ ID NO: 511: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Protein ... ^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 511: 

Val Leu Met Leu Lys Met Xaa Leu He Lys Lys Leu Thr Gin Met Glu 

15 10 15 

Asp Leu Val Asn Gin Asn Asp Xaa Leu Thr Asp Glu Glu Lys Gin Xaa 

20 25 30 

Ala He Gin Val He Glu Glu His Lys Asn Glu He He Gly Asn He 

35 40 45 

Gly Asp Gin Thr Thr Asp Asp Gly Val Thr Arg He Thr Arg Ser Arg 

50 55 60 

Tyr Thr Asp Leu Lys Trp Gly Tyr Cys Asn Thr Gly Cys 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 512: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512: 

Val Lys Trp He Lys Arg Leu Leu Lys Asp Leu Lys Lys Leu Ser Met 

15 10 15 

Leu Phe Lys Asp He Leu Gin Leu Val Pro Lys Gin Leu Phe Gly Gly 

20 25 30 

Gly Asp Ala Glu Leu Gin Leu Thr Glu Ala Asp Tyr Leu Thr Ala Gly 

35 40 45 

He Asp He Val Val Gin Pro Pro Gly Lys Lys Leu Gin His Leu Ser 

50 55 60 

Leu Leu Ser Gly Gly Glu Arg Ala Leu Thr Ala He Ala Leu Leu Phe 
65 70 75 80 

Ala He Leu Lys Val Arg Ser Ala Pro Phe Val He Leu Asp Glu Val 

85 90 95 

Glu Ala Ala Leu Asp Glu Ala Asn Val He Arg Tyr Ala Lys Tyr Leu 

100 105 110 

Asn Glu Leu Ser Asp Glu Thr Gin Phe He Val He Thr His Arg Lys 
115 . . 120 125 
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Gly Thr Met Glu Phe Ala Asp Arg Leu Tyr. Gly Val Thr Met Gin Glu v „ 

130 135 140 

Ser Gly Val Thr Lys Leu Val Ser Val Asn Leu Asn Thr lie Asp Asp 
145 150 155 160 

Val Leu Lys Glu Glu Gin 
165 

(2) INFORMATION FOR SEQ ID NO: 513: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513: 

Val Arg Asp Gin Phe Lys Phe Tyr His Val Arg His Glu Glu Val Ala 

15 10 15 

Ser Leu Ala Ala Ala Gly Tyr Thr Lys Leu Thr Gly Xaa lie Gly Val 

20 25 30 

Ala Leu Ser lie Gly Xaa Pro Gly Leu lie His Leu Leu Asn Gly Met 

35 40 45 

Tyr Asp Ala Lys Met Asp Asn Val Leu Val Pro lie Asn lie lie Trp 

50 55 60 

Thr Asn Xaa lie Val Gin His Leu Glu Arg Lys His Phe Lys Lys Gin 
65 70 75 80 

lie Tyr Lys Asn Tyr Val Lys Met 
85 

(2) INFORMATION FOR SEQ ID NO: 514: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 153 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514: 
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Val Lys Leu Ser Xaa Tyr Tyr Pro Gin Gly. Leu Arg Ser Leu Asn Gly v . 

15 10 15 

Gly Arg Met Ala Arg Phe Gly Arg Thr Pro Leu Leu Asp Ala Met Glu 

20 25 30 

Met Ala Asn Glu His He Met Val He Ala Met He Glu Asp Val Xaa 

35 40 45 

Gly Val Met Ala He Asp Asp He Ala Gin Val Glu Gly Leu Asp Met 

50 55 60 

He Val Glu Gly Ala Ala Gly Phe He Ala Val Thr Trp His Thr Xaa 
65 70 75 80 

Ala Asn Glu Arg Asp Asp Gin Val Thr Ser His Xaa Gin His He Xaa 

85 90 95 

Xaa Val Val Asn Ala His Gly Lys His Xaa Cys Ala Leu Pro Arg Glu 

100 105 HO 

Asp Glu Asp He Ala Lys Trp Gin Ala Gin Gly Val Gin Thr Phe He 

115 120 125 

Leu Gly Thr Ser Gly Lys He Tyr Arg His Leu Ser Ala Ser Leu Ala 

130 135 140 

Thr Ser Lys Gin Lys Gly Asp Asp Gly 
145 150 



(2) INFORMATION FOR SEQ ID NO: 515: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515: 

Val Leu Thr Ser Leu Asp Ala Asp Glu Ala Lys Arg Asn Ala Tyr Thr 

15 10 15 

Asn Ala Val Thr Gin Ala Xaa Gin He Leu Asn Lys Ala Gin Gly Pro 

20 25 30 

Asn Thr Ala Lys Asp Gly Val Glu Thr Ala Leu Gin Asn Val Gin Arg 

35 40 45 

Ala Lys Asn Glu Leu Ser Gly Asn Gin Asn Val Ala Asn Ala Lys Thr 

50 55 60 

Thr Ala Lys Asn Ala Leu Asn Asn Leu Thr Ser He Asn Asn Ala His 
65 70 75 80 
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.Lys Ala Ala Leu Lys Ser Gin lie Glu Gly.Ala Thr Thr Val Ala Gly^. 

85 90 95 

Val Asn Gin Val Ser Thr Met Ala Ser 
100 105 



(2) INFORMATION FOR SEQ ID NO: 516: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D> TOPOLOGY: linear 

(ii> MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516: 



Val Asn Thr Ala Lys Thr 

1 5 
Ala Lys Asn Thr Ala Lys 
20 

Asn Ala Gin Lys Ala Asn 
35 

Val Ala Gly Val Gin Gly 
50 

Ala Met Asn Gin Leu Arg 
65 70 
Ser Ser Glu Asp Tyr Gin 
85 

Asn Asp Ala Val Thr Asn 
100 

Pro Glu Met Asn Pro Asp 
115 

Ser Ala Lys Ser Ala Leu 
130 

Gin Thr Ala Lys Ser Asp 
145 150 
Gin Arg Thr Ala Xaa Asn 
165 

Ala Val Thr Ala Ala Lys 
180 

Gly Asn Val Lys His Ala 
195 



Ala Arg Leu Asn Glu 
15 

Met Ser His Leu Thr 
30 

Glu Arg Gly Thr Thr 
45 

Gly Thr Leu Asn Gin 
60 

Lys Asp Ala Thr Lys 
80 

Leu Gin Asn Ala Tyr 
95 

Ser Ala Xaa Asn Asn 
110 

Ala Ser Gin Val Asn 
125 

Leu Ala Ala Ala Lys 
140 

Asp Leu Asn Asn Ala 
160 

Ala Pro Xaa Leu Ala 
175 

Leu Asn Thr Ala Met 
190 

Asn Thr Xaa Arg Ser 
205 



Ala Leu Asn Gly Asp 
10 

Gin Gin Leu Ala Thr 
25 

Leu Thr Glu Gin lie 
40 

lie Gin Ala Asn Ala 
55 

Gin Ser lie Ala Ser 
75 

Asp Ala Asn Ala Asp 
90 

Ala Gly Gly He He 
105 

Thr He Xaa Gin Lys 
120 

Xaa Gly Asp Glu Lys 
135 

He Gly Arg Val Thr 
155 

Ala Glu Val Asp Gin 
170 

Asn Lys Ala Thr Ser 
185 

Leu Ala Glu Lys Asp 
200 
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Val Asn Tyr Thr Asp Ala Asp Gin 

210 215 
Ala Gly Thx Gin Ala Glu Ala lie 
225 230 
Asn Glu Thr Gin Val Gin Ala Ala 
245 

Asn Asp Leu Glu Trp Val He He 
260 

Gin Asn Val His 
275 



Pro Xaa. Gin Gin Ala Xaa Asp Thr^ 
220 

Thr Asn Ala Asn Gly Ser Xaa Ala 
235 240 
Leu Asn Gin Leu Asn Gin Ala Lys 

250 255 
Lys Leu Leu Lys Arg Lys Lys Gin 
265 270 



(2) INFORMATION FOR SEQ ID NO: 517: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517: 



Val Asn Thr Thr Lys Ala Ala Leu His Gly Asp val Lys Leu Gin Asn 

15 10 15 

Asp Lys Asp His Ala Lys Gin Thr Val Ser Gin Leu Ala His Leu Asn 

20 25 30 

Asn Ala Gin Lys His Met Glu Asp Thr Leu He Asp Ser Glu Thr Thr 

35 40 45 

Arg Thr Ala Val Lys Gin Asp Leu Thr Glu Val Gin Ala Leu Asp Gin 

50 55 60 

Leu Met Asp Ala Leu Gin Gin Ser He Ala Asp Lys Asp Ala Thr Arg 
65 70 75 80 

Ala Ser Ser Ala Tyr Val Asn Ala Glu Pro Asn Lys Lys Gin Ala Tyr 

85 90 95 

Asp Glu Ala Val Gin Asn Ala Glu Ser He He Ala Gly Leu Asn Asn 

100 105 110 

Pro Thr He Asn Lys Gly Asn Val Ser Ser Ala Thr Gin Ala Val He 

115 120 125 

Ser Ser Lys Asn Ala Leu Asp Gly Val Glu Arg Leu Ala Gin Asp Lys 

130 135 140 

Gin Thr Ala Gly Asn Ser Leu Asn His Leu Asp Gin Leu Thr Pro Ala 
145 150 155 160 
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Gin Gin Gin Ala Leu Glu Asn Gin lie Asn. Asn Ala Thr Thr Arg Aspv. 

165 170 175 

Lys Val Ala Glu He He Ala Gin Ala Gin Ala Leu Asn Glu Ala Met 

180 185 190 

Lys Ala Leu Lys Xaa Ser lie Lys Asp Gin Pro Gin Thr Glu Ala Ser 

195 200 205 

Ser Lys Phe He Asn Glu Asp Gin Ala Gin Lys Asp Ala Tyr Thr Gin 

210 215 220 

Ala val Gin His Ala Arg Arg Cys Leu Xaa 
225 230 

(2) INFORMATION FOR SEQ ID NO: 518: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Procein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 

Val Ala Glu He He Ala Gin Ala Gin Ala Leu Asn Glu Ala Met Lys 

15 10 15 

Ala Leu Lys Glu Ser He Lys Asp Gin Pro Gin Thr Glu Ala Ser Ser 

20 25 30 

Lys Phe He Asn Glu Asp Gin Ala Gin Lys Asp Ala Tyr Thr Gin Ala 

35 40 45 

Val Gin His Glu Arg Lys He 
50 55 

(2) INFORMATION FOR SEQ ID NO: 519: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 229 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519: 
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Val Val Val Leu Arg Met Ala Ser Asn Met. Pro Ser Leu Xaa Lys Val v , t 

15 10 15 

Glu Asn Val Glu Met lie Ala Phe Cys Asp Val Asp lie Ser Lys Ala 

20 25 30 

Ala Ser Ala Ala Glu Ala Tyr Gly Thr Asp Asn Ala Lys Val Tyr Asp 

35 40 45 

Asp Tyr Lys Ala Leu Leu Lys Asp Asp Thr lie Asp Val He His Val 

50 55 60 

Cys Thr Pro Asn Asp Ser His Cys Glu Tie Thr Val Ala Gly Leu His 
65 70 75 80 

Ala Gly Lys His Val Met Cys Glu Lys Pro Met Ala Lys Thr Thr Ala 

85 90 95 

Glu Ala Gin Lys Met He Asp Thr Ala Lys Ser Thr Gly Lys Lys Leu 

100 105 110 

Thr He Gly Tyr Gin Asn Arg Phe Arg Pro Asp Ser Gin Phe Leu His 

115 120 125 

Gin Ala Ala Gin Arg Gly Asp Leu Gly Asp He Tyr Phe Gly Lys Ala 

130 135 140 

His Ala lie Arg Arg Arg Ala Val Pro Thr Trp Gly Val Phe Leu Asn 
145 150 155 160 

Glu Glu Ala Gin Gly Gly Gly Pro Leu He Asp He Gly Thr His Ala 

165 170 175 

Leu Asp Leu Thr Leu Trp Met Met Asp Asn Tyr Glu Pro Glu Ser Val 

180 185 190 

Met Gly Ser Thr Phe His Lys Leu Asn Lys Gin His Asp Ala Pro Asn 

195 200 205 

Ala Trp Gly Ser Trp Asn Pro Asp Glu Leu Thr Val Glu Asp Ser Ala 

210 215 220 

Phe Gly Phe He Lys 
225 

(2) INFORMATION FOR SEQ ID NO: 520: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO; 520: 
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.Val Asp Glu Ala Lys Cys Ser Leu 

1 5 
Met Lys Asp Val Leu Arg lie His 
20 

Thr Lys His Val Glu Xaa Glu Asn 

35 40 
Asn Glu Val Asp Glu Ala Glu Glu 

50 55 
Val Val Asn Asp Thr Glu Pro Val 
65 70 
Val Thr Lys lie Leu Glu Ala He 
85 

Ala He Tyr Phe Glu 
100 



Leu Gly. Thr Lys Ala Gly Ala Aspv- 

10 15 
Gly Glu Asp Met Gly Thr Leu Tyr 
25 30 
Lys Gly Val Asp Phe Tyr Glu Gly 
45 

Glu Xaa Lys Ala Trp He Asp Ala 
60 

Val Lys Pro Glu Gin Ala Met Val 

75 80 
Tyr Gin Ser Ala Lys Ser Gly Lys 
90 95 



(2) INFORMATION FOR SEQ ID NO: 521: 



(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521: 



Val Asp Pro Pro Gly Cys Arg Asn Ser Thr Arg Thr Arg Ala Ala Phe 

15 10 15 

Thr Val Ala Ser He Asp Leu Gly Ala His Pro Glu Phe Leu Gly Lys 

20 25 30 

Asn Asp He Gin Leu Xaa Lys Lys Glu Ser Val Glu Asp Thr Xaa Lys 

35 40 45 

Val Leu Gly Arg Met Phe Asp Gly He Glu Phe Lys Leu 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 522 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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. .. . (ii) MOLECULE TYPE: Protein ... * 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 

Val lie Pro Asn Glu Phe Leu Lys His Ser Gly Lys Val His Ala Gin 

15 10 15 

Ala Phe Phe Thr Gin Asn Gly Ser Asn Asn Val Val Val Glu Arg Gin 

20 25 30 

Phe Ser Phe Asn lie Glu Asn Asp Leu Val Ser Gly Xaa Asp Gly lie 

35 40 45 

Thr Lys Leu Val Tyr He Lys Ser He Gin Asp Thr He Glu Ala Val 

50 55 60 

Gly Lys Asp Phe Asn Gin Leu Lys Gin Asn Met Ala Asp Thr Gin Thr 
65 70 75 80 

Leu He Ala Lys Val Asn Asp Ser Ala Thr Lys Gly He Gin Gin He 

85 90 95 

Glu He Lys Gin Asn Glu Ala He Gin Ala He Thr Ala Thr Gin Thr 

100 105 110 

Ser Ala Thr Gin Ala Val Thr Ala Glu Val Gly 
115 120 

(2) INFORMATION FOR SEQ ID NO: 523: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 229 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523: 

Val Met Asn He He Asn Leu Glu Lys Pro Lys Gly Val Val Val Gin 

15 10 15 

Phe Gly Gly Gin Thr Ala He Asn Leu Ala Asp Lys Leu Ala Lys His 

20 25 30 

Gly Val Lys He Leu Gly Thr Ser Leu Glu Asn Leu Asn Arg Ala Glu 

35 40 45 

Asp Arg Lys Glu Phe Glu Ala Leu Leu Arg Lys He Asn Val Pro Gin 

50 55 60 

Pro Gin Gly Lys Ser Ala Thr Ser Pro Glu Glu Ala Leu Ala Asn Ala 
65 70 75 80 
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Ala Glu He Gly Tyr Pro 
85 

Gly Arg Ala Met Glu He 
100 

Met Thr Gin Ala Val Lys 
115 

Arg Tyr Leu Thr Gly Lys 
130 

Glu Thr Val He He Pro 
145 150 
Val His Ser Gly Asp Ser 
165 

Glu Asp Glu Leu Ala Thr 
180 

Gly Leu Asn He He Gly 
195 

Asp Gly Val Tyr Cys Phe 
210 

Xaa His Ser Arg Ala 
225 

(2) INFORMATION FOR SEQ ID NO: 524: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524: 

Val He Phe Met Thr Asn Asn Lys Val Ala Leu Val Thr Gly Gly Ala 

15 10 15 

Gin Gly He Gly Phe Lys He Ala Glu Arg Leu Val Glu Asp Gly Phe 

20 25 30 

Lys Val Ala Val Val Asp Phe Asn Glu Glu Gly Ala Lys Ala Ala Ala 

35 40 45 

Leu Lys Leu Ser Ser Asp Gly Thr Lys Ala He Ala He Lys Ala Asp 

50 55 60 

Val Ser Asn Arg Asp Asp Val Phe Asn Ala Val Arg Gin Ala Thr Pro 
65 70 75 80 



val Val Val Arg. Pro 
90 

Val Asp Asn Asp Lys 
105 

Ala Ser Pro Glu His 
120 

Glu He Glu Val Asp 
135 

Gly He Met Glu His 
155 

He Ala Val Tyr Pro 
170 

Leu Glu Asp Tyr Thr 
185 

Leu He Asn He Gin 
200 

Arg Ser Lys Pro Thr 
215 



Ser Tyr Val Leu Glyo. 
95 

Glu Leu Glu Asn Tyr 
110 

Pro Val Leu Val Asp 
125 

Ala He Cys Asp Gly 
140 

He Glu Arg Ala Gly 
160 

Pro Gin Thr Leu Thr 
175 

He Lys Leu Ala Lys 
190 

Phe Val He Ala His 
205 

Val Leu Val Glu Arg 
220 
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. Ser Ala lie Trp Arg Phe Pro Cys His Gly.-. 

85 90 

(2) INFORMATION FOR SEQ ID NO: 525: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525: 

Val Arg lie His Tyr Val Asp He He Met Phe Lys Asp Val Val He 

15 10 15 

Trp Glu Val Cys Leu He Arg Leu Leu Val He Lys Val Leu Lys Arg 

20 25 30 

Met He Phe Ala Arg Ala Lys Phe Glu Lys Glu Tyr Gly Val Lys Leu 

35 40 45 

Asn Pro Lys Ala Gly Lys Asp Asn His Glu Met Val Glu Gly He His 

50 55 60 

Asp Gly Glu Val His Ser Leu Tyr Leu Tyr Gly Glu Asp Thr Gly He 
65 70 75 80 

Val Asp Ser Asn He Asn. Phe Val Gin Ala Ala Phe Glu Lys Leu Asp 

85 90 95 

Phe Met Val Val Gin Asp Glu Phe Leu Thr Phe Thr Ala Thr Tyr Ala 

100 105 110 

Asp Val Val Leu Pro Ala Ser Pro Ser Leu Glu Lys Asp Gly Thr Phe 

115 120 125 

Thr Asn Thr Glu Arg Arg He Gin Arg Leu Tyr Gin Ala Leu Glu Pro 

130 135 140 

Leu Gly Asp Ser Lys Pro Asp Trp Lys He Phe Gin Ala He Ala Asn 
145 150 155 160 

Arg Leu Gly He Trp He Gly lie Thr Ser He Leu Val Lys Leu Trp 

165 170 175 

He Glu Gly Arg Thr Leu Asn Thr Ser He Cys Trp Gly Lys Leu 
180 185 190 

(2) INFORMATION FOR SEQ ID NO: 52 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 123 amino acids 
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. . .(B) TYPE; amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526: 

Val Pro lie Arg His Glu Arg Xaa Asp Lys Cys Glu Xaa lie Met Tyr 

15 10 15 

Ala Asp Asn Met Thr Asp Xaa Met Lys Tyr Ala lie Asp Glu Thr Gin 

20 25 30 

Arg Arg Arg Glu lie Gin Met Lys His Asn Glu Lys His Gly lie Thr 

35 40 45 

Pro Lys Thr lie Asn Lys Xaa Val His Asp Leu He Ser Ala Thr Val 

50 55 60 

Glu Asn Asp Glu Asn Xaa Asp Xaa Ala Gin Thr val He Pro Lys Lys 
65 70 75 80 

Met Pro Lys Arg Ala Arg Xaa Lys Thr He Asp Asn He Glu Lys Glu 

85 90 95 

Met Lys Gin Ala Ala Lys Asp Leu Asp Phe Glu Lys Ala Xaa Glu Leu 

100 105 no 

Arg Asp Met Leu Phe Glu Leu Lys Ala Xaa Gly 
115 120 

(2) INFORMATION FOR SEQ ID NO: 527: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 46 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527: 

Val Xaa Ala Asn Glu He Xaa Lys Xaa Lys He Asp Ala Asn Lys Asp 

15 10 15 

Val Asp Lys Gin Val Gin Ala Leu He Asp Glu He Asp Arg Asn Pro 

20 25 30 

Asn Leu Thr Asp Lys Glu Lys Gin Ala Leu Lys Arg Ser Tyr 
35 40 45 



382 



EP 0 841 394 A2 



- (2) INFORMATION FOR SEQ ID NO: 528: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528: 

Val Ser Glu Phe Asn Glu lie lie lie Ala Thr Pro Ala Gin Trp lie 

15 10 15 

Ser His Thr Gin Asp He Leu Lys Lys Tyr Asn He Thr Asp Gin Arg 

20 25 30 

Val Lys Val Val Ala Gly Gly Xaa Asp Arg Asn Glu Thr He Met Asn 

35 40 45 

He He Asp His He Arg Asn Val Asn Gly He Asn Asn Asp Asp Val 

50 55 60 

He Val Thr His Asp Ala Val Arg Pro Phe Leu Thr Gin Arg He He 
65 70 75 80 

Lys Glu Asn He Glu Val Ala Xaa Xaa Tyr Gly Ala Val Asp Thr Val 

85 90 95 

He Glu Ala He Asp Thr He Val Met Ser Lys Asp Lys Gin Asn He 

100 105 HO 

His Ser He Pro Val Arg Asn Glu Met Tyr Gin Gly Gin Thr Pro Gin 

115 120 125 

Ser Phe Asn He Lys Leu Leu Gin Asp Ser Tyr Arg Ala Leu Ser Ser 

130 135 140 

Glu Pro Xaa Arg Asn Leu He Arg Cys Met 
145 150 

(2) INFORMATION FOR SEQ ID NO: 52 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529: 
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70 



20 



25 



35 



Val Leu Phe Thr Ser Asp Ser Arg Pro Thr Met Ser Gly Asn Phe Glu 

1 5 10 15 . 

Tyr He Tyr Asn Glu Met Leu Arg Gin Asn Leu Asp Lys Lys Tyr Asp 

20 25 30 

He His Thr Val Phe Lys Ala Asn lie Thr Asp Arg Arg Gly He He 

35 40 45 

Asp Lys Phe Arg Leu Pro Tyr Leu Leu Gly Lys Ala Asp Tyr He Phe 

50 55 so 

Val Asp Asp Phe His Pro Leu He Tyr Thr Val Arg Phe Arg Arg Ser 
65 70 75 go 

Gin Glu Val He Gin Val Trp His Ala Val Gly Ala Phe Lys Thr Val 

85 90 95 

Gly Phe Ser Arg Thr Gly Lys Lys Gly Gly Pro Phe He Asp Ser Leu 

100 105 no 

Asn His Arg Ser Ser Cys Gin Ser Leu Cys He He 
115 120 

(2) INFORMATION FOR SEQ ID NO: 530: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

3Q (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530: 



Val Xaa His Ala Val Gly Ala Phe Lys Thr Val Gly Phe Ser Arg Thr 
40 1 5 10 15 

Gly Xaa Lys Gly Gly Pro Phe lie Asp Ser Leu Lys Pro He Val Val 
20 25 30 

Thr Pro Lys Ala Tyr Val Ser Ser Glu Thr Arg Tyr Ser He Leu Cys 
45 35 40 45 



(2) INFORMATION FOR SEQ ID NO: 531: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531: 

Val Tyr Phe Asp Leu Gly Lys Thr Ser Gly Ser Gly Thr Asn Ala Asn 

15 10 15 

Lys Val Thr lie Thr Lys lie Met Gly Trp Lys 



(2) INFORMATION FOR SEQ ID NO: 532: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532: 

Val Glu He Met Lys He Thr Val Asn Asp Lys Asn Glu Val He Gly 

15 10 15 

Tyr Val Asn Thr Gly Gly Leu Arg Asn Ser Leu Asp Val Asp Asp Asn 

20 25 30 

Asn Val Pro He Lys Phe Lys Glu Glu Phe Glu Pro Arg Lys Phe Val 

35 40 45 

Phe Thr Asn Gly Glu He Lys Tyr Asn Ser Asn Phe Glu Lys Glu Asp 

50 55 60 

Val Pro Asn Ala Ser Ser Gin Gin Ser Glu Ser Asp Leu Ser Asp Glu 
65 70 75 80 

Glu Leu Arg Gly Met Val Ala Ser Met Gin Met Gin Val Ala Gin Val 

85 90 95 

Asn Val Leu Thr Met Glu Leu Ala Gin Gin Asn Ala Met Leu Thr Gin 

100 105 110 

Gin Leu Thr Glu Leu Lys Thr Asn Lys Thr Ser Thr Glu Gly Asp Val 



20 



25 



115 



120 



125 



(2) INFORMATION FOR SEQ ID 



NO: 533: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 amino acids 

(B) TYPE: amino acid 
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. . (C) STRANDEDNESS : single ... t 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 533: 

Val Arg Gin Leu Ala Gin Ala Lys Lys Lys Ser Thr Ala Lys Lys Lys 

15 10 15 

Thr Ala Ser Lys Lys Arg Thr Asn Ser Arg Lys Lys Lys Asn Asp Asn 

20 25 30 

Pro lie Arg Tyr Val lie Ala He Leu Val Val Val Leu Met Val Leu 

35 40 45 

Gly Val Phe Gin Leu Gly He He Gly Arg Leu lie Asp Ser Phe Phe 

50 55 60 

Asn Tyr Leu Phe Gly Tyr Ser Arg Tyr Leu Thr Tyr He Leu Val Leu 
65 70 75 80 

Leu Ala Thr Gly Phe He Thr Tyr Ser Lys Arg He Pro Lys Thr Arg 

85 90 95 

Arg Thr Ala Gly Ser He Val Leu Gin He Ala Leu Leu Phe Val Ser 

100 105 110 

Gin Leu Val Phe His Phe Asn Ser Gly He Lys Ala Glu Arg Glu Pro 

115 120 125 

Val Leu Ser Tyr Val Tyr Gin Ser Tyr Gin His Ser His Phe Pro Asn 

130 135 140 

Phe Gly Gly Gly Val Leu Gly Phe Tyr Leu Leu Glu Leu Ser Val Pro 
145 150 155 160 

Leu He Ser Leu Phe Gly Val Cys He He Thr He Leu Leu Leu Cys 

165 170 175 

Ser Ser Val He Leu Leu Thr Asn His Gin His Arg Asp Val Ala Lys 

180 185 190 

Val Ala Leu Glu Asn He Lys Ala Trp Phe Gly Ser Phe Asn Glu Lys 

195 200 205 

Met Ser Glu Arg Asn Gin Glu Lys Gin Leu Lys Arg Glu Glu Lys Ala 

210 215 220 

Arg Leu Lys Glu Glu Gin Lys Ala Arg Gin Asn Glu Gin Pro Gin He 
225 230 235 240 

Lys Asp Val Ser Asp Phe Thr Glu Val Pro Gin Glu Arg Asp He Pro 

245 250 255 

He Tyr Gly His Thr Glu Asn Glu Ser Lys Ser Gin Cys Gin Pro Ser 

260 265 270 

Arg Lys Lys Arg Val Phe Asp Ala Glu Asn Ser Ser Asn Asn He Val 
275 280 285 
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Asn His Gin Ala Asp Gin Gin Glu Gin Leu. Thr Glu Gin Thr His Asn* 

290 295 300 

Ser Val Glu Ser Glu Asn Thr lie Glu Glu Ala Gly Glu Val Thr Asn 

305 310 315 320 

Val Ser Tyr Val Val Pro Pro Leu Thr Leu Leu Asn Gin Pro Ala Lys 

325 330 335 

Gin Lys Ala Thr Ser Lys Ala Glu Val Gin Arg Lys Gly Gin Val Pro 

340 345 350 

Lys Arg Tyr Xaa Lys Arg Phe Trp Gly Xaa lie Xaa Lys Xaa Thr Gin 

355 360 365 

Xaa Lys Leu Ser Xaa Ser Asn Ser lie 
370 375 



(2) INFORMATION FOR SEQ ID NO: 534: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534: 



Val Met Thr Glu Glu Xaa Cys Leu Leu Lys Arg Val Phe Met Arg Met 

15 10 15 

Lys Arg Phe Leu Thr lie Val Gin lie Leu Leu Val Val lie He He 

20 25 30 

He Phe Gly Tyr Lys He Val Gin Thr Tyr He Glu Asp Lys Gin Glu 

35 40 45 

Arg Ala Xaa Tyr Glu Lys Leu Gin Xaa Lys Phe Xaa Met Leu Met Xaa 

50 55 60 

Lys His Gin Glu His Val Arg Pro Gin Phe Glu Ser Leu Glu Lys He 
65 70 75 80 

Asn Lys Asp He Val Gly Trp He Lys Leu Ser Gly Thr Ser Leu Xaa 

85 90 95 

Tyr Pro Val Leu Gin Gly Lys Thr Asn His Asp Tyr Leu Asn Leu Asp 

100 105 110 

Phe Glu Arg Glu His Arg Arg Lys Gly Ser He Phe Met Asp Phe Arg 

115 120 125 

Asn Glu Leu Xaa Asn Leu Asn His Asn Thr He Leu Tyr Gly His His 
130 135 140 
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.Val Gly Asp Asn Thr Met Phe Asp Val Leu. Glu Asp Tyr Leu Lys Glnv. 
145 150 155 160 

Ser Phe Tyr Glu~tys His Lys lie He Gly Phe Asp Asn Lys Tyr Gly 

165 170 175 

Lys Tyr Gin Leu Gin Val Phe Ser Ala Tyr 
180 185 



Claims 

1. An isolated polynucleotide comprising a polynucleotide sequence selected from the group consisting of: 

(a) a polynucleotide having at least a 70% identity to a polynucleotide encoding a polypeptide comprising an 
amino acid sequence of the Sequence Listing; 

(b) a polynucleotide having at least a 70% identity to a polynucleotide encoding a mature polypeptide ex- 
pressed by the gene contained in the S. aureus of the deposited strain that was sequenced to obtain a poly- 
nucleotide sequence of the Sequence Listing; 

(c) a polynucleotide encoding a polypeptide comprising an amino acid sequence which is at least 70% identical 
to an amino acid sequence of the Sequence Listing; 

(d) a polynucleotide which is complementary to the polynucleotide of (a), (b) or (c); and 

(e) a polynucleotide comprising at least 1 5 sequential bases of the polynucleotide of (a), (b), (c) or (d). 

2. The polynucleotide of Claim 1 wherein the polynucleotide is DNA. 

3. The polynucleotide of Claim 1 wherein the polynucleotide is RNA. 

4. The polynucleotide of Claim 2 comprising the nucleic acid sequence selected from the group consisting of the 
nucleic acid sequences set forth in the Sequence Listing. 

5. The polynucleotide of Claim 2 which encodes a polypeptide comprising an amino acid sequence sequence selected 
from the group consisting of the amino acid sequences set forth in the Sequence Listing. 

6. A vector comprising the polynucleotide of Claim 1 . 

7. A host cell comprising the vector of Claim 6. 

8. A process for producing a polypeptide comprising: expressing from the host cell of Claim 7 a polypeptide encoded 
by said DNA. 

9. A process for producing a polypeptide or fragment comprising culturing a host of claim 7 under conditions sufficient 
for the production of said polypeptide or fragment. 

10. A polypeptide comprising an amino acid sequence which is at least 70% identical to an amino acid sequence 
selected from the group consisting of the amino acid sequences set forth in the Sequence Listing. 

11. A polypeptide comprising an amino acid sequence selected from the group consisting of the amino acid sequences 
set forth in the Sequence Listing. 

12. An antibody against the polypeptide of claim 10. 

13. An antagonist or agonist of the activity or expression of the polypeptide of claim 10. 

14. A method for the treatment or prevention of disease of an individual comprising: administering to the individual a 
therapeutically effective amount of the polypeptide of claim 10. 
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15. A method for the treatment of an individual having need to inhibit a bacterial polypeptide comprising: administering 
to the individual a therapeutically effective amount of the antagonist of Claim 1 3. 

16. A process for diagnosing a disease related to expression or activity of the polypeptide of claim 10 in an individual 
5 comprising: 

(a) determining a nucleic acid sequence encoding said polypeptide, and/or 

(b) analyzing for the presence or amount of said polypeptide in a sample derived from the individual. 

io 1 7. A method for identifying compounds which interact with and inhibit or activate an activity of the polypeptide of claim 
10 comprising: 

contacting a composition comprising the polypeptide with the compound to be screened under conditions to 
permit interaction between the compound and the polypeptide to assess the interaction of a compound, such 
'5 interaction being associated with a second component capable of providing a detectable signal in response 

to the interaction of the polypeptide with the compound; 

and determining whether the compound interacts with and activates or inhibits an activity of the polypeptide 
by detecting the presence or absence of a signal generated from the interaction of the compound with the 
polypeptide. 

20 

18. A method for inducing an immunological response in a mammal which comprises inoculating the mammal with 
the polypeptide of claim 10, or a fragment or variant thereof, adequate to produce antibody and/or T cell immune 
response to protect said animal from disease. 

25 19. A method of inducing immunological response in a mammal which comprises delivering a nucleic acid vector to 
direct expression of a polypeptide of claim 10, or fragment or a variant thereof, for expressing said polypeptide, 
or a fragment or a variant thereof in vivo in order to induce an immunological response to produce antibody and/ 
or T cell immune response to protect said animal from disease. 

30 20. A polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO: 1 , 52, 
95, 184, 127, 128, 153, 212, 215 and 227. 

21. A polypeptide comprising a polypeptide encoded by the polynculeotide of claim 20. 

35 22. The isolated polynucleotide of claim 1 wherein said nucleotide is selected from the group consisting of: 

(a) a polynucleotide having at least a 90% identity to a polynucleotide encoding a polypeptide comprising the 
amino acid sequence of the Sequence Listing; 

(b) a polynucleotide having at least a 90% identity to a polynucleotide encoding the same mature polypeptide 
*o expressed by the gene contained in the S. aureus of the deposited strain that was sequenced to obtain a 

polynucleotide sequence of the Sequence Listing; 

(c) a polynucleotide encoding a polypeptide comprising an amino acid sequence which is at least 90% identical 
to the amino acid sequence of the Sequence Listing; 

(d) a polynucleotide which is complementary to the polynucleotide of (a), (b) or (c); and 

4 5 (e) a polynucleotide comprising at least 1 5 sequential bases of the polynucleotide of (a), (b), (c) or (d). 

23. The isolated polynucleotide of claim 1 selected from the group consisting of: 

(a) a polynucleotide having at least a 95% identity to a polynucleotide encoding a polypeptide comprising the 
50 amino acid sequence of the Sequence Listing; 

(b) a polynucleotide having at least a 95% identity to a polynucleotide encoding the same mature polypeptide 
expressed by the gene contained in the S. aureus of the deposited strain that was sequenced to obtain a 
polynucleotide sequence of the Sequence Listing; 

(c) a polynucleotide encoding a polypeptide comprising an amino acid sequence which is at least 95% identical 
55 to the amino acid sequence of the Sequence Listing; 

(d) a polynucleotide which is complementary to the polynucleotide of (a), (b) or (c); and 

(e) a polynucleotide comprising at least 1 5 sequential bases of the polynucleotide of (a), (b), (c) or (d). 



389 



EP 0 841 394 A2 



24. An isolated polynucleotide comprising a polynucleotide sequence selected from the group consisting ot: 

(a) a polynucleotide having at least a 50% identity to a polynucleotide encoding a polypeptide comprising the 
amino acid sequence of the Sequence Listing and obtained from a prokaryotic species other than S. aureus, 
s (b) a polynucleotide encoding a polypeptide-comprising an amino acid sequence which is at least 50% identical 

to the amino acid sequence of the Sequence Listing and obtained from a prokaryotic species other than S. 
aureus; and 

(c) a polynucleotide which is complementary to the polynucleotide of (a) or (b). 

10 25. An isolated Streptococcal polypeptide having one of the amino acid sequences given in Table 1 . 

26. An isolated nucleic acid encoding one of the amino acid sequences of Claim 1 and nucleic acid sequences capable 
of hybridizing therewith under stringent conditions. 

15 27. Recombinant vectors comprising the n ucleic acid sequences of Claim 26 and host cells transformed or transf ected 
therewith. 

28. A method of identifying an antimicrobial compound comprising contacting candidate compounds with a polypeptide 
of Claim 1 and selecting those compounds capable of inhibiting the bioactivity of said polypeptide. 



20 



30 



35 



45 



29. Antimicrobial compounds identified by the method of Claim 28. 
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